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GOODPASTURE ANTIGEN BINDING PROTEIN 

Cross Reference 

This application claims priority to U.S. Provisional Patent Application Serial No. 
60/1 21 ,483, filed February 24, 1 999. 

Statement of Government Rights 

This work was supported in part by Grants SAL91/0513, SAF94/1051 and 
SAF97/0065 from the Plan Nacional I+D of the Comision Interministerial de Ciencia 
Tccnologia (CICYT, Spain), Grant 93/0343 from Fondo de Investigaciones Sanitarias 
(FISss, Spain) and Grants GV-3166/95, GV-C-VS-21-1 18-96 from la Direccio General 
d'Enscnyaments Universitaris i Investigacio (Comunitat Valenciana, Spain); therefore 
the Slate of Spain may have rights in the invention. 

Field of the Invention 

The invention relates to the fields of protein kinases, automimmune disease, 
apoptosis. and cancer. 

Background of the Invention 

Goodpasture (GP) disease is an autoimmune disorder described only in humans. In 
GP patients, autoantibodies against the non-collagenous C-terminal domain (NCI) of the 
type IV collagen a3 chain ("Goodpasture antigen") cause a rapidly progressive 
glomerulonephritis and often lung hemorrhage, the two cardinal clinical manifestations of 
the GP syndrome (see 1 for review. The reference numbers in this section correspond to 
reference list of Example 1). 

The idea that common pathogenic events exist at least for some autoimmune 
disorders is suggested by the significant number of patients displaying more than one 
autoimmune disease, and also by the strong and common linkage that some of these 
diseases show to specific MHC haplotypes (31, 32). The experimental observation that the 
autoamigen is the leading moiety in autoimmunity and that a limited number of self- 
components are autoantigenic (31), suggest that these self-components share biological 
features with important consequences in self/non-self recognition by the immune system. 



WO 00/50607 



2 



PCT/IB00/00324 



One possibility is that triggering events, by altering different but specific self-components, 
would result in abnormal antigen processing. In certain individuals expressing a particular 
MHC specificity, the abnormal peptides could be recognized by non-tolerized T cells and 
trigger an immune response (1 ): 

We have previously explored the GP antigen to identify biological features of 
relevance in autoimmune pathogenesis. Since the NCI domain is a highly conserved 
domain among species and between the different type IV collagen a chains (al-a6) (2), 
the exclusive involvement of the human a3(IV)NCl in a natural autoimmune response 
suggests that this domain has structural and/or biological peculiarities of pathogenic 
relevance. Consistent with this, the N-terminus of the human antigen is highly divergent, 
and it contains a unique five-residue motif (KRGDS 9 ) that conforms to a functional 
phosphorylation site for type A protein kinases (3, 4). Furthermore, the human a3 gene, 
but not the other related human or homologous genes from other species, is alternatively 
spliced and generates multiple transcripts also containing the phosphorylatable N-terminal 
region (5-7). Recent studies indicate that the phosphorylation of the N-terminus of the GP 
antigen by cAMP-dependent protein kinase is up regulated by the presence of the 
alternative products (see Example 3 below). Specific serine phosphorylation and pre- 
mRNA alternative splicing are also associated with the biology of other autoantigens 
including the acetylcholine receptor and myelin basic protein (MBP) (4). The latter is 
suspected to be the major antigen in multiple sclerosis (MS), another exclusively human 
autoimmune disease in which the immune system targets the white matter of the central 
nervous system. GP disease and MS are human disorders that display a strong association 
with the same HLA class II haplotype (HLA DRB1*1501)(32, 33). This, along with the 
recent report of death by GP disease of an MS patient carrying this HLA specificity (34), 
supports the existence of common pathogenic events in these human disorders. 

Thus, specific serine/threonine phosphorylation may be a major biological 
difference between the human GP antigen, the GP antigens of other species, and the 
homologous domains from the other human a(IV) chains, and might be important in 
pathogenesis (1, 4). 

Therefore, the identification and isolation of the specific serine/threonine kinase 
that phosphorylates the N-terminal region of the human GP antigen would be very 
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advantageous for the diagnosis and treatment of GP syndrome, and possibly for other 
autoimmune disorders. 

5 'Summary of the Invention 

The present invention fulfills the need in the art for the identification and isolation 
of a serine/threonine kinase that specifically binds to and phosphorylates the unique N- 
terminal region of the human GP antigen. In one aspect, the present invention provides 
nucleic acid sequences encoding various forms of the Goodpasture antigen binding protein 
10 (GPBP), as well as recombinant expression vectors operatively linked to the GPBP- 
encoding sequences. 

In another aspect,' the present invention provides host cells that have been 
transfected with the recombinant expression vectors. In a further aspect, the present 
invention provides substantially purified GPBP and antibodies that selectively bind to 

1 5 GPBP. In still further aspect, the invention provides methods for detecting the presence of 
GPBP or nucleic acids encoding GPBP. 

In a further aspect, the present invention provides methods for detecting the 
presence of an autoimmune condition or apoptosis, which comprises detecting an 
increase in the expression of GPBP in a tissue compared to a control tissue. 

20 In another aspect, the present invention provides methods and pharmaceutical 

compositions for treating an autoimmune disorder, apoptosis, or a tumor, comprising 
modifying the expression or activity of GPBP in a patient in need thereof. 

Brief Description of the Figures 

25 Figure 1. Nucleotide and derived amino acid sequences of n4\ The denoted 

structural features are from 5 1 to 3'end: the cDNA present in the original clone (HeLal) 
(dotted box), which contains the PH homology domain (in black) and the Ser-Xaa-Yaa 
repeat (in gray); the heptad repeat of the predictable coiled-coil structure (open box) 
containing the bipartite nuclear localization signal (in gray); and a serine-rich domain 

30 (filled gray box). The asterisks denote the positions of in frame stop codons. 

Figure 2. Distribution of GPBP in human tissues (Northern blot) and in 
eukaryotic species (Southern blot). A random primed 32 P-labeled HeLal cDNA probe 
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was used to identify homologous messages in a Northern blot of poly(A + )RNA from 
the indicated human tissues (panel A) or in a Southern blot of genomic DNA from the 
indicated eukaryotic species (panel B). Northern hybridization was performed under 
highly stringent conditions to* detect perfect matching messages and at low stringency 
in the Southern to allow the detection of messages with mismatches. No appreciable 
differences in the quality and amount of each individual poly A+ RNA was observed 
by denaturing gel electrophoresis or when probing a representative bloi: from the same 
lot with human (5-actin cDNA. The numbers denote the position and the sizes in kb of 
the RNA or DNA markers used. 

Figure 3. Experimental determination of the translation start site. In (A), the 
two cDNAs present in pc-n4' and pc-FLAG-n4' plasmids used for transient expression 
arc represented as black lines. The relative position of the corresponding predicted 
(n4*) or engineered (FLAG-n4') translation start site is indicated (Met). In (B), the 
extracts from control (-), pc-n4'(n4') or pc-FLAG-n4' (FLAG-n4') transfected 293 
cells were subjected to SDS-PAGE under reducing conditions in 10% gels. The 
separated proteins were transferred to a PVDF membrane (Millipore) and blotted with 
the indicated antibodies. The numbers and bars indicate the molecular mass in kDa and 
the relative positions of the molecular weight markers, respectively. 

Figure 4. Characterization of rGPBP from yeast and 293 cells. In (A), 1 jag 
(lane 1 ) or 100 ng (lanes 2 and 3) of yeast rGPBP were analyzed by reducing SDS- 
PAGi: in a 10% gel. The separated proteins were stained with Coomassie blue (lane 1) 
or transferred and blotted with anti-FLAG antibodies (lane 2) or Mabl4, a monoclonal 
antibody against GPBP (lane 3). In (B), the cell extracts from GPBP-expressing yeast 
were analyzed as in A and blotted with anti-FLAG (lane 1), anti-PSer (lane 2), anti- 
PThr (lane 3) or anti-PTyr (lane 4) monoclonal antibodies respectively. In (C), 200 ng 
of cither yeast rGPBP (lane 1), dephosphorylated yeast rGPBP (lane 2) or 293 cells- 
derived rGPBP (lane 3) were analyzed as in B with the indicated antibodies. In (D), 
similar amounts of H3 32 P0 4 -labeled non-transfected (lanes 1), stable pc-n4' transfected 
(lanes 2) or transient pc-FLAG-n4' expressing (lanes 3) 293 cells were lysed, 
precipitated with the indicated antibodies and analyzed by SDS-PAGE and 
autoradiography. The molecular weight markers are represented with numbers and bars 
as in Figure 3. The arrows indicate the position of the rGPBP. 
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Figure 5. Recombinant GPBP contains a serine/threonine kinase that 
specifically phosphorylates the N-terminal region of the human GP antigen. To 

assess phosphorylation, approximately 200 ng of yeast rGPBP was incubated with 
[y] 32 P-ATP in the absence (A and B) or presence of GP antigen-derived material (C). In 

5 (A), the mixture was subjected to reducing SDS-PAGE (10% gel) and 
autoradiographed. In (B), the mixture was subjected to 32 P-phosphoamino acid analysis 
by two-dimensional thin-layer chromatography. The dotted circles indicate the position 
of ninhydrin stained phosphoamino acids. In (C), the phosphorylation mixtures of the 
indicated GP-derived material were analyzed by SDS-PAGE (15% gel) and 

10 autoradiography (GPpepl and GPpeplAla 9 ) or immunoprecipitated with Mab 17, a 
monoclonal antibody that specifically recognize GP antigen from human and bovine 
origin, and analyzed by SDS-PAGE (12.5%) and autoradiography (rGP, GP). The 
relative positions of rGPBP (A), rGP antigen and the native human and bovine GP 
antigens (C) are indicated by arrows. The numbers and bars refer to molecular weightier 

15 markers as in previous Figures. 

Figure 6. In-blot renaturation of the serine/threonine kinase present in 
rGPBP. Five micrograms of rGPBP from yeast were in-blot renatured. The 
recombinant material was specifically identified by anti-FLAG antibodies (lane 1) and 
the in situ 32 P-incorporation detected by autoradiography (lane 2). The numbers and 

20 bars refer to molecular weight markers as in previous Figures. The arrow indicates the* 
position of the 89 kDa rGPBP polypeptide. 

Figure 7. Immunological localization of GPBP in human tissues. Rabbit 
serum against the N-terminal region of GPBP (1:50) was used to localize GPBP in 
human tissues. The tissues shown are kidney (A) glomerulus (B), lung (C), alveolus 

25 (D), liver (E), brain (F), testis (G), adrenal gland (H), pancreas (I) and prostate (J). 
Similar results were obtained using anti-GPBP affinity-purified antibodies or a pool of 
culture medium from seven different GPBP-specific monoclonal antibodies (anti-GPBP 
Mabs 3, 4, 5, 6, 8, 10 and 14). Rabbit pre-immune serum did not stain any tissue 
structure in parallel control studies. Magnification was 40X except in B and D where it 

30 was 100X. 

Figure 8. GPBPA26 is a splicing variant of GPBP. (A) Total RNA from 
normal skeletal muscle was retrotranscribed using primer 53c and subsequently 
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subjected to PCR with primers llm-53c (lane 2) or 15m-62c {lane 4). Control 
amplifications of a plasmid containing GPBP cDNA using the same pairs of primers 
are shown in lanes 1 and 3. Numbers on the left and right refer to molecular weight in 
base pairs. The region missing in the normal muscle transcript was identified and its 
nucleotide sequence {lower case) and deduced amino acid sequence {upper case) are 
shown in (B). A clone of genomic DNA comprising the cDNA region of interest was 
sequenced and its structure is drawn in (C), showing the location and relative sizes of 
the 78-bp exon spliced out in GPBPA26 (black box), adjacent exons (gray boxes), and 
introns (lines). The size of both intron and exons is given and the nucleotide sequence 
of intron-exon boundaries is presented, with consensus for 5' and 3' splice sites shown 
in bold case. 

Figure 9. Differential expression of GPBP and GPBPA26. Fragments 
representing the 78-bp exon (GPBP) or flanking sequences common to both isoforms 
(GPBP/GPBPA26) were 32 P-labeled and used to hybridize human tissue and tumor cell 
line Northern blots (CLONTECH). The membranes were first hybridized with GPBP- 
specific probe, stripped and then reanalyzed with GPB P/GPBP A26 probe. Washing 
conditions were less stringent for GPBP-specific probe (0.1% SSPE, 37°C or 55°C) 
than for the GPBP/GPBPA26 (0.1% SSPE, 68°C) to increase GPBP and GPBPA26 
signals respectively. No detectable signal was obtained for the GPBP probe when the 
washing program was at 68°C (not shown). 

Figure 10. GPBPA26 displays lower phosphorylating activity than GPBP. 

(A) Recombinantly-expressed, affinity-purified GPBP (rGPBP) (lanes 1) or rGPBPA26 
(lanes 2) were subjected to SDS-PAGE under reducing conditions and either Coomasie 
blue stained (2 jj.g per lane) or blotted (200ng per lane) with monoclonal antibodies 
recognizing the FLAG sequence (a-FLAG) or GPBP/GPBPA26 (Mabl4). (B) 200 ng 
of rGPBP (lanes 1 ) or rGPBP A26 (lanes 2) were in vitro phosphorylated without 
substrate to assay auto-phosphorylation (left), or with 5 nmol GPpepl to measure trans- 
phosphorylation activity (right). An arrowhead indicates the position of the peptide. (C) 
3 ^ig of rGPBP (lane 1) or rGPBPA26 (lane 2) were in-blot renatured as described 
under Material and Methods. The numbers and bars indicate the molecular mass in kDa 
and the relative position of the molecular weight markers, respectively. 
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Figure 11. rGPBP and rGPBPA26 form very active high molecular weight 
aggregates. About 300 jag of rGPBP (A) or rGPBPA26 (B) were subjected to gel 
filtration HPLC as described under Material and Methods. Vertical arrowheads and 
numbers respectively indicate* the elutioh profile and molecular mass (kDa) of the 

5 molecular weight standards used. Larger aggregates eluted in the void volume (I), and 
the bulk of the material present in the samples eluted in the fractionation range of the 
column as a second peak between the 669 and 158 kDa markers (II). Fifteen microliters 
of the indicated minute fractions were subjected to SDS-PAGE and Coomasie blue 
staining. Five microliters of the same fractions were in vitro phosphorylated as 

10 described in Materials and Methods, and the reaction stopped by boiling in SDS sample 
buffer. The fractions were loaded onto SDS-PAGE, transferred to PVDF and 
autoradiographed for 1 or '2 hours using Kodak X-Omat films and blotted using anti- 
FLAG monoclonal antibodies (Sigma). 

Figure 12. Self-interaction of GPBP and GPBPA26 assessed by a yeast two- 
15 hybrid system. (A) Cell transfected for the indicated combinations of plasmids were 
selected on leucine-tryptophan-deficient medium (-Trp. -Leu), and independent 
iransformants restreaked onto histidinc-deficient plates {-Trp t -Leu, -His) in the 
presence or absence of 1 mM 3-amino-triazole (3-AT), to assess interaction. The 
picture was taken 3 days after streaking. (B) The bars represent mean values in p- 
20 galactosidase arbitrary units of four independent P-galactosidase in-solution assays. 

Figure 13. GPBP is expressed associated with endothelial and glomerular 
basement membranes. Paraffin embedded sections of human muscle (A) or renal 
cortex (B, C) were probed with GPBP-specific antibodies (A,B) or with Mabl89, a 
monoclonal antibody specific for the human a3(IV)NCl (C). Frozen sections of human 
25 kidney (D-F) were probed with Mabl7, a monoclonal antibody specific for the 
ot3(IV)NCl domain (D), GPBP-specific antibodies (E), or sera from a GP patient (F). 
Control sera (chicken pre-immune and human control) did not display tissue-binding in 
parallel studies (not shown). 

Figure 14. GPBP is expressed in human but not in bovine and murine renal 
30 cortex. Cortex from human (A, D), bovine (B, E) or murine (C. F) kidney were paraffin 
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embedded and probed with either GPBP-specific antibodies (A-C) or GPBP/GPBPA26- 
specific antibodies (D-F). 

Figure 15. GPBP is highly expressed in several autoimmune conditions. 

Skeletal muscle total RNA from a control individual (lane 1) or from a GP patient (lane 
5 2) was subjected to RT-PCR as in Fig.8, using the oligonucleotides 15m and 62c in the 
amplification program. Frozen (B-D) or paraffin embedded (E-G) human control skin 
(B, E) or skin affected by SLE (C, F) or lichen planus (D, G) were probed with GPBP- 
specific antibodies. 

Figure 16. Phosphorylation of GP alternative splicing products by PKA. In 

10 left panel, equimolecular amounts of rGP (lanes 1), rGPAV (lanes 2), rGPAIII (lanes 3) 
or rGPAIII/IV/V (lanes 4), equivalent to 500 ng of the GP were phosphorylated at the 
indicated ATP concentrations. One-fifth of the total phosphorylation reaction mixture 
was separated by gel electrophoresis and transferred to PVDF, autoradiographed 
(shown) and the proteins blotted with M3/1, a specific monoclonal antibody 
15 recognizing all four species (shown) or using antibodies specific for each individual C- 
terminal region (not shown). Arrowheads indicate the position of each recombinant 
protein, from top to bottom, GP, GPAV and, GPAIII -GPAIII/IV/V which displayed the 
same mobilities. Right panel: purified ot3(IV)NCl domain or hexamer was 
phosphorylated with PKA and 0.1 ^iM ATP in the absence (lanes 1) or in the presence 
20 of 10 nmol of peptides representing the C-terminal region of either GPAIII (lanes 2) or 
GPAIII/IV/V (lanes 3). Where indicated the phosphorylation mixtures of purified 
a3(IV)NCl domain were V8 digested and immunoprecipitated with antibodies specific 
for the N terminus of the human a3(IV)NCl domain (3). Bars and numbers indicate the 
position and sizes (kDa) of the molecular weight markers. 
25 Figure 17. Sequence alignment of GPAIII and MBP. The phosphorylation 

sites for PKA (boxed) and the structural similarity for the sites at Ser 8 and 9 of MBP 
and GPAIII respectively are shown (underlined). The identity (vertical bars) and 
chemical homology (dots) of the corresponding exon II (bent arrow) of both molecular 
species are indicated. The complete sequence of GPAIII from the collagenase cleavage 
30 site (72-residues) is aligned with the 69-N terminal residues of MBP comprising the 
exon I and ten residues of the exon II. 
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Figure 18. Phosphorylation of recombinant MBP proteins by PKA. About 
200 ng of rMBP (lane 1), or Ser to Ala mutants thereof in position 8 (lane 2) or 57 (lane 
3). or rMPBAII (lane 4) or Ser to Ala mutants thereof in position 8 (lane 5) or 57 (lane 
6). were phosphorylated by PKA and 0.1 u.M ATP. The mixtures were subjected to 
5 SDS-PAGE, transferred to PVDF and autoradiography (Phosphorylation) and the 
individual molecular species blotted with monoclonal antibodies against human MBP 
obtained from Roche Molecular Biochemicals (Western). 

Figure 19. Phosphorylation of recombinant MBP proteins by GPBP. About 
200 ng of rMBP (lane 1), or Ser to Ala mutants thereof in positions 8 (lane 2) or 57 
10 ( lane 3 K or rMPBAII (lane 4), or Ser to Ala mutants thereof in positions 8 (lane 5) or 57 
(lane <>>. were subjected to SDS-PAGE, transferred to PVDF, and the area containing 
the proteins visualized with Ponceau and stripped out. The immobilized proteins were 
in situ phosphorylated with rGPBP as described in Materials and Methods, 
autoradiographed (Phosphorylation) and subsequently blotted as in Fig. 18 (Western). 

15 

Figure 20. Regulation of the GPBP by the C terminal region of GPAIII. 

About 2no ng of rGPBP were in vitro phosphorylated with 150 jaM ATP in the absence 
(lane I > or in the presence of 5 nmol of GPAIII-derived peptide synthesized either using 
Boc- I lane 2) or Fmoc- (lane 3) chemistry. The reaction mixtures were subjected to 
20 SDS-PAGE. transferred to PVDF and autoradiographed to asses autophosphorylation, 
and subsequently blotted with anti-FLAG monoclonal antibodies (Sigma) to determine 
the amount of recombinant material present (Western). 

25 Detailed Description of the Invention 

All references cited are herein incorporated by reference in their entirety. 
The abbreviations used herein are: bp, base pair; DTT, dithiothreitol; DMEM, 
Dulbccco's modified Eagle's medium; EDTA, ethylenediamine tetraacetic acid; EGTA, 
ethylene glycol-bis(P-aminoethyl ether) N,N,N\N' -tetraacetic acid; GP, Goodpasture; 
30 rGPANI. rGPAIII/IV/V and rGPAV, recombinant material representing the alternative 
forms of the Goodpasture antigen resulting from splicing out exon III, exon III, IV and 
V or exon V, respectively; GPBP and rGPBP, native and recombinant Goodpasture 
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antigen binding protein; GPBPA26 and rGPBPA26, native and recombinant alternative 
form of the GPBP; GST, glutathione S-transferase; HLA, human lymphocyte antigens; 
HPLC, high performance liquid chromatography; Kb, thousand base pairs; kDa, 
thousand daltons; MBP, rMBP, native and recombinant 21 kDa myelin basic protein; 
5 MBPAII and rMBP All, native and recombinant 18.5 kDa myelin basic protein that 
results from splicing out exon II; MBPAV and MBPAII/V, myelin basic protein 
alternative forms resulting from splicing out exon V and exons II and V, respectively; 
MHC, major histocompatibility complex; NCI, non-collagenous domain; PH, 
pleckstrin homology; PRA, cAMP-dependent protein kinase; PMSF, 
10 phenylmethylsulfonyl fluoride; SDS-PAGE, sodium dodecylsulfate polyacrylamide gel 
electrophoresis; TBS, tris buffered saline. 

Within this application, unless otherwise stated, the techniques utilized may be 
found in any of several well-known references such as: Molecular Cloning: A 
Laboratory Manual (Sambrook, et al., 1989, Cold Spring Harbor Laboratory Press), 
15 Gene Expression Technology (Methods in Enzymology, Vol. 185, edited by D. 
Goeddel, 1991. Academic Press, San Diego, CA), "Guide to Protein Purification" in 
Methods in Enzymology (M.P. Deutshcer, ed., (1990) Academic Press, Inc.); PCR 
Protocols: A Guide to Methods and Applications (Innis, et al. 1990. Academic Press, 
San Diego, CA), Culture of Animal Cells: A Manual of Basic Technique, 2 nd Ed. (R.I. 
20 Freshney. 1987. Liss, Inc. New York, NY), Gene Transfer and Expression Protocols, 
pp. 109-128, ed. E.J. Murray, The Humana Press Inc., Clifton, NJ.), and the Ambion 
1998 Catalog (Ambion, Austin, TX). 

As used herein, the term "GPBP" refers to Goodpasture binding protein, and 
includes both monomers and oligomers thereof. Human (SEQ ID NO:2), mouse (SEQ 
25 ID NO:4), and bovine GPBP sequences (SEQ ID NO:6) are provided herein. 

As used herein, the term "GPBPAie" refers to Goodpasture binding protein 
deleted for the 26 amino acid sequence shown in SEQ ID NO:14 r and includes both 
monomers and oligomers thereof. Human (SEQ ID NO:8), mouse (SEQ ID NO: 10), 
and bovine GPBP sequences (SEQ ID NO: 1 2) are provided herein. 
30 As used herein the term "GPBPpepl" refers to the 26 amino acid peptide 

shown in SEQ ID NO: 14, and includes both monomers and oligomers thereof. 
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As used herein, the term "GP antigen" refers to the a3 NCI domain of type IV 
collagen. 

As used herein, "MBP" refers to myelin basic protein. 

In one aspect, the present invention provides isolated nucleic acids that encode 
5 GPBP, GPBPA26, and GPBPpepl, and mutants or fragments thereof. In one 
embodiment, the isolated nucleic acids comprise sequences substantially similar to SEQ 
ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:ll, 
SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO: 17, SEQ ID NO:19, SEQ ID NO:21, SEQ 
ID NO:23, or SEQ ID NO:25, or fragments thereof 
10 In another aspect, the present invention provides isolated nucleic acids that encode 

alternative products of the GP antigen or MBP. In one embodiment, the isolated nucleic 
acids comprise sequences that encode peptides substantially similar to SEQ ID NO:43 and 
SEQ ID NO:44. 

The phrase "substantially similar " is used herein in reference to the nucleotide f 

15 sequence of DNA or RNA, or the amino acid sequence of protein, having one or more 
conservative or non-conservative variations from the disclosed sequences, including but 
not limited to deletions, additions, or substitutions, wherein the resulting nucleic acid 
and/or amino acid sequence is functionally equivalent to the sequences disclosed 
herein. Functionally equivalent sequences will function in substantially the same 

20 manner to produce substantially the same protein disclosed herein. For example, 
functionally equivalent DNAs encode proteins that are the same as those disclosed 
herein or that have one or more conservative amino acid variations, such as substitution 
of a non-polar residue for another non-polar residue or a charged residue for a similarly 
charged residue. These changes include those recognized by those of skill in the art as 

25 substitutions that do not substantially alter the tertiary structure of the protein. 

In practice, the term substantially similar means that DNA encoding two 
proteins hybridize to one another under conditions of moderate to high stringency, and 
encode proteins that have either the same sequence of amino acids, or have changes in 
sequence that do not alter their structure or function. As used herein, substantially 

30 similar sequences of nucleotides or amino acids share at least about 70% identity, more 
preferably at least about 80% identity, and most preferably at least about 90% identity. 
It is recognized, however, that proteins (and DNA or mRNA encoding such proteins) 
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containing less than the above-described level of homology arising as splice variants or 
that arc modified by conservative amino acid substitutions (or substitution of 
degenerate codons) are contemplated to be within the scope of the present invention. 

Stringency of hybridization is used herein to refer to conditions under which 
* nucleic acid hybrids are stable. As known to those of skill in the art, the stability of 
hybrids is reflected in the melting temperature (T M ) of the hybrids. T M decreases 
approximately 1-1. 5°C with every 1% decrease in sequence homology. In general, the 
stability of a hybrid is a function of sodium ion concentration and temperature. 
Typically, the hybridization reaction is performed under conditions of lower stringency, 
followed by washes of varying, but higher, stringency. Reference to hybridization 
stringency relates to such washing conditions. Thus, as used herein, moderate 
stringency refers to conditions that permit hybridization of those nucleic acid sequences 
that form stable hybrids in 0.1% SSPE at 37°C or 55°C, while high stringency refers to 
conditions that permit hybridization of those nucleic acid sequences that form stable 
hvbrids in 0. 1 %SSPE at 65°C. It is understood that these conditions may be duplicated 
using a variety of buffers and temperatures and that they are not necessarily precise. 
Denhardfs solution and SSPE (sec, e.g., Sambrook, Fritsch, and Maniatis, in: 
Molecular Cloning, A Laboratory Manual Cold Spring Harbor Laboratory Press, 1989) 
are well known to those of skill in the art, as are other suitable hybridization buffers. 

The isolated nucleic acid sequence may comprise an RNA, a cDNA, or a 
genomic clone with one or more introns. The isolated sequence may further comprise 
additional sequences useful for promoting expression and/or purification of the encoded 
protein, including but not limited to polyA sequences, modified Kozak sequences, and 
sequences encoding epitope tags, export signals, and secretory signals, nuclear 
localization signals, and plasma membrane localization signals. 

In another aspect, the present invention provides recombinant expression vectors 
composing nucleic acid sequences that express GPBP, GPBPA26, or GPBPpepl, and 
mutants or fragments thereof. In one embodiment, the vectors comprise nucleic acid 
sequences that are substantially similar to the sequences shown in SEQ ID NO:l, SEQ ID 
NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO:13, 
SEU II) NO: 15, SEQ ID NO:17, SEQ ID NO: 19, SEQ ID NO:21, SEQ ID NO:23, or 
SEQ ID NO:25, or fragments thereof. 
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In another aspect, the present invention provides recombinant expression vectors 
comprising nucleic acid sequences that express peptides that are substantially similar to 
the amino acid sequence shown in SEQ ID NO:43, SEQ ID NO:44, or peptide fragments 
thereof. 

"Recombinant expression vector" includes vectors that operatively link a 
nucleic acid coding region or gene to any promoter capable of effecting expression of 
the gene product. The promoter sequence used to drive expression of the disclosed 
nucleic acid sequences in a mammalian system may be constitutive (driven by any of a 
variety of promoters, including but not limited to, CMV, SV40, RSV, actin, EF) or 
inducible (driven by any of a number of inducible promoters including, but not limited 
to. tetracycline, ecdysone, steroid-responsive). The construction of expression vectors 
for use in transfecting prokaryotic cells is also well known in the art, and thus can be 
accomplished via standard techniques. (See, for example, Sambrook, Fritsch, and 
Maniatis. in: Molecular Cloning, A Laboratory Manual, Cold Spring Harbor 
Laboratory Press, 1989; Gene Transfer and Expression Protocols, pp. 109-128, ed. E.J. 
Murray. The Humana Press Inc., Clifton, N.J.), and the Ambion 1998 Catalog 
( Ambion. Austin, TX) 

The expression vector must be replicable in the host organisms either as an 
cpisome or by integration into host chromosomal DNA. In a preferred embodiment, 
the expression vector comprises a plasmid. However, the invention is intended to 
include other expression vectors that serve equivalent functions, such as viral vectors. 

In a further aspect, the present invention provides host cells that have been 
transfecied with the recombinant expression vectors disclosed herein, wherein the host 
cells can be either prokaryotic or eukaryotic. The cells can be transiently or stably 
transfected. Such transfection of expression vectors into prokaryotic and eukaryotic 
cells can be accomplished via any technique known in the art, including but not limited 
to standard bacterial transformations, calcium phosphate co-precipitation, 
clectroporation, or liposome mediated-, DEAE dextran mediated-, polycationic 
mediated-, or viral mediated transfection. (See, for example, Molecular Cloning: A 
Laboratory Manual (Sambrook, et al., 1989, Cold Spring Harbor Laboratory Press; 
Culture of Animal Cells: A Manual of Basic Technique, 2 nd Ed. (R.I. Freshney. 1987. 
Liss, Inc. New York, NY), 
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In a still further aspect, the present invention provides substantially purified 
GPBP, GPBPA26, and GPBPpepl, and mutants or fragments thereof. In one 
embodiment, the amino acid sequence of the substantially purified protein is substantially 
similar to SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, 
5 SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ 
ID NO:22, SEQ ID NO:24, or peptide fragments thereof. 

In another aspect, the present invention provides substantially purified alternative 
products of the GP antigen and MBP. In one embodiment, the amino acid sequence of the 
substantially purified polypeptide is substantially similar to SEQ ID NO:43, SEQ ID 
l o NO:44, or peptide fragments thereof. 

As used herein, the term "substantially purified" means that the protein has been 
separated from its in vivo cellular environments. Thus, the protein can either be 
purified from natural sources, or recombinant protein can be purified from the 
transfected host cells disclosed above. In a preferred embodiment, the proteins are 
1 5 produced by the transfected cells disclosed above, and purified using standard techniques. 
(See for example, Molecular Cloning: A Laboratory Manual (Sambrook, et al., 1989, 
Cold Spring Harbor Laboratory Press.)) The protein can thus be purified from 
prokaryotic or eukaryotic sources. In various further preferred embodiments, the 
protein is purified from bacterial, yeast, or mammalian cells. 
20 The protein may comprise additional sequences useful for promoting 

purification of the protein, such as epitope tags and transport signals. Examples of such 
epitope tags include, but are not limited to FLAG (Sigma Chemical, St. Louis, MO), 
myc (9E10) (Invitrogen, Carlsbad, CA), 6-His (Invitrogen; Novagen, Madison, WI), 
and HA (Boehringer Manheim Biochemicals). Examples of such transport signals 
25 include, but are not limited to, export signals, secretory signals, nuclear localization 
signals, and plasma membrane localization signals. 

In another aspect, the present invention provides antibodies that selectively bind to 
GPBP, GPBPA26, or GPBPpepl. In one aspect, the antibodies selectively bind to a 
protein comprising a sequence selected from the group consisting of SEQ ID NO:2, SEQ 
30 ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO: 14, 
SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, or 
peptide fragments thereof. Such antibodies can be produced by immunization of a host 
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animal with either the complete GPBP, or with antigenic peptides thereof. The antibodies 
can be either polyclonal or monoclonal. 

In another aspect, the present invention provides antibodies that selectively bind to 
a polypeptide comprising an amino acid sequence substantially similar to a sequence 

5 selected from the group consisting of SEQ ID NO:43, SEQ ID NO:44, SEQ ED NO: 46, 
SEQ ID NO: 48, SEQ ID NO:50, SEQ ID NO:54, or antigenic fragments thereof. The 
antibodies can be either polyclonal or monoclonal. 

Antibodies can be made by well-known methods, such as described in Harlow 
and Lane, Antibodies; A Laboratory Manual, Cold Spring Harbor Laboratory, Cold 

10 Spring Harbor, N.Y., (1988). In one example, preimmune serum is collected prior to 
the first immunization. Substantially purified proteins of the invention, or antigenic 
fragments thereof, together with an appropriate adjuvant, is injected into an animal in 
an amount and at intervals sufficient to elicit an immune response. Animals are bled at 
regular intervals, preferably weekly, to determine antibody titer. The animals may or w 

15 may not receive booster injections following the initial immunization. At about 7 days - 
after each booster immunization, or about weekly after a single immunization, the 
animals are bled, the serum collected, and aliquots are stored at about -20° C. 
Polyclonal antibodies against the proteins and peptides of the invention can then be 
purified directly by passing serum collected from the animal through a column to which 

20 non-antigen-related proteins prepared from the same expression system without GPBP- 
relatcd proteins bound. 

Monoclonal antibodies can be produced by obtaining spleen cells from the 
animal. (See Kohler and Milstein, Nature 256, 495-497 (1975)). In one example, 
monoclonal antibodies (mAb) of interest are prepared by immunizing inbred mice with 

25 the proteins or peptides of the invention, or an antigenic fragment thereof. The mice 
are immunized by the IP or SC route in an amount and at intervals sufficient to elicit an 
immune response. The mice receive an initial immunization on day 0 and are rested for 
about 3 to about 30 weeks. Immunized mice are given one or more booster 
immunizations of by the intravenous (IV) route. Lymphocytes, from antibody positive 

30 mice are obtained by removing spleens from immunized mice by standard procedures 
known in the art. Hybridoma cells are produced by mixing the splenic lymphocytes 
with an appropriate fusion partner under conditions which will allow the formation of 
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stable hybridomas. The antibody producing cells and fusion partner cells are fused in 
polyethylene glycol at concentrations from about 30% to about 50%. Fused hybridoma 
cells are selected by growth in hypoxanthine, thymidine and aminopterin supplemented 
Dulbecco's Modified Eagles • Medium (DMEM) by procedures known in the art. 
Supernatant fluids are collected from growth positive wells and are screened for 
antibody production by an immunoassay such as solid phase immunoradioassay. 
Hybridoma cells from antibody positive wells are cloned by a technique such as the soft 
agar technique of MacPherson, Soft Agar Techniques, in Tissue Culture Methods and 
Applications, Kruse and Paterson, Eds., Academic Press, 1973. 

To generate such an antibody response, the proteins of the present invention are 
typically formulated with a pharmaceutically acceptable carrier for parenteral 
administration. Such acceptable adjuvants include, but are not limited to, Freund's 
complete, Freund's incomplete, alum-precipitate, water in oil emulsion containing 
Corvncbacterium parvum and tRNA. The formulation of such compositions, including 
the concentration of the polypeptide and the selection of the vehicle and other 
components, is within the skill of the art. 

The term antibody as used herein is intended to include antibody fragments 
thereof which are selectively reactive with the proteins and peptides of the invention, or 
fragments thereof. Antibodies can be fragmented using conventional techniques, and 
the fragments screened for utility in the same manner as described above for whole 
antibodies. For example, F(ab')2 fragments can be generated by treating antibody with 
pepsin. The resulting F(ab') 2 fragment can be treated to reduce disulfide bridges to 
produce Fab' fragments. 

In a further aspect, the invention provides methods for detecting the presence of 
the proteins or peptides of the invention in a protein sample, comprising providing a 
protein sample to be screened, contacting the protein sample to be screened with an 
antibody against the proteins or peptides of the invention, and detecting the formation 
of antibody-antigen complexes. The antibody can be either polyclonal or monoclonal 
as described above, although monoclonal antibodies are preferred. As used herein, the 
term "protein sample" refers to any sample that may contain the proteins or peptides of 
the invention, and fragments thereof, including but not limited to tissues and portions 
thereof, tissue sections, intact cells, cell extracts, purified or partially purified protein 
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samples, bodily fluids, nucleic acid expression libraries. Accordingly, this aspect of 
the present invention may be used to test for the presence of GPBP, GPBPA26, 
GPBPpepl, or alternative products of the GP antigen in these various protein samples 
by standard techniques including, but not limited to, immunolocalization, 
5 immunofluorescence analysis, Western blot, analysis, ELISAs, and nucleic acid 
expression library screening, (See for example, Sambrook et al, 1989.) In one 
embodiment, the techniques may determine only the presence or absence of the protein 
or peptide of interest. Alternatively, the techniques may be quantitative, and provide 
information about the relative amount of the protein or peptide of interest in the sample. 

10 For quantitative purposes, ELISAs are preferred. 

Detection of immunocomplex formation between the proteins or peptides of the 
invention, or fragments thereof, and their antibodies or fragments thereof, can be 
accomplished by standard detection techniques. For example, detection of . 
immunocomplexes can be accomplished by using labeled antibodies or secondary sr 

15 antibodies. Such methods, including the choice of label are known to those ordinarily : 
skilled in the art. (Harlow and Lane, Supra). Alternatively, the polyclonal or 
monoclonal antibodies can be coupled to a detectable substance. The term "coupled" is 
used to mean that the detectable substance is physically linked to the antibody. 
Suitable detectable substances include various enzymes, prosthetic groups, fluorescent 

20 materials, luminescent materials and radioactive materials. Examples of suitable 
enzymes include horseradish peroxidase, alkaline phosphatase, P-galactosidase, or 
acetylcholinesterase. Examples of suitable prosthetic-group complexes include 
streptavidin/biotin and avidin/biotin. Examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 

25 dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin. An example of a 
luminescent material includes luminoL Examples of suitable radioactive material 
include l25 I, 13I I, 35 S or 3 H. 

Such methods of detection are useful for a variety of purposes, including but not 
limited to detecting an autoimmune condition, identifying cells targeted for or 

30 undergoing apoptosis, immunolocalization of the proteins of interest in a tissue sample, 
Western blot analysis, and screening of expression libraries to find related proteins. 
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In yet another aspect, the invention provides methods for detecting the presence in 
a sample of nucleic acid sequences encoding the GPBP, GPBPA26, GPBPpepl, or 
alternative products of the GP antigen comprising providing a nucleic acid sample to be 
screened, contacting the sample with a nucleic acid probe derived from the isolated 
5 nucleic acid sequences of the invention, or fragments thereof, and detecting complex 
formation. 

As used herein, the term "sample" refers to any sample that may contain GPBP- 
related nucleic acid, including but not limited to tissues and portions thereof, tissue 
sections, intact cells, cell extracts, purified or partially purified nucleic acid samples, 
10 DNA libraries, and bodily fluids. Accordingly, this aspect of the present invention may 
be used to test for the presence of GPBP mRNA or DNA in these various samples by 
standard techniques including, but not limited to, in situ hybridization, Northern 
blotting, Southern blotting, DNA library screening, polymerase chain reaction (PCR) or 
reverse transcription-PCR (RT-PCR). (See for example, Sambrook et al, 1989.) In one 
15 embodiment, the techniques may determine only the presence or absence of the nucleic 
acid of interest. Alternatively, the techniques may be quantitative, and provide 
information about the relative amount of the nucleic acid of interest in the sample. For 
quantitative purposes, quantitative PCR and RT-PCR are preferred. Thus, in one 
example, RNA is isolated from a sample, and contacted with an oligonucleotide derived 
20 from the nucleic acid sequence of interest, together with reverse transcriptase under 
suitable buffer and temperature conditions to produce cDNAs from the GPBP-related 
RNA. The cDNA is then subjected to PCR using primer pairs derived from the nucleic 
acid sequence of interest. In a preferred embodiment, the primers are designed to detect 
the presence of the RNA expression product of SEQ ID NO:5, and the amount of GPBP 
25 gene expression in the sample is compared to the level in a control sample. 

For detecting the nucleic acid sequence of interest, standard labeling techniques 
can be used to label the probe, the nucleic acid of interest, or the complex between the 
probe and the nucleic acid of interest, including, but not limited to radio-, enzyme-, 
chemiluminescent-, or avidin or biotin-labeling techniques, all of which are well known 
30 in the art. (See, for example, Molecular Cloning: A Laboratory Manual (Sambrook, et 
al., 1989, Cold Spring Harbor Laboratory Press), Gene Expression Technology 
(Methods in Enzymoiogy, Vol. 185, edited by D. Goeddel, 1991. Academic Press, San 
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Diego, CA); PCR Protocols: A Guide to Methods and Applications (Innis, et al. 1990. 
Academic Press, San Diego, CA)). 

Such methods of nucleic acid detection are useful for a variety of purposes, 
including but not limited to diagnosing an autoimmune condition, identifying cells 
targeted for or undergoing apoptosis, in situ hybridization, Northern and Southern blot 
analysis, and DNA library screening. 

As demonstrated in the following examples, GPBP shows preferential 
expression in tissue structures that are commonly targeted in naturally-occurring 
automimmune responses, and is highly expressed in several autoimmune conditions, 
including but not limited to Goodpasture Syndrome (GP), systemic lupus 
erythematosus (SLE), and lichen planus. Furthermore, following a similar 
experimental approach to that described below, recombinant proteins representing 
autoantigens in GP disease (a3 Type IV collagen), SLE (PI ribosomal phosphoprotein 
and Sm-Dl small nuclear ribonucleoproteins) and dermatomyositis (hystididyl-tRNA . 
synthetase) were shown to be in vitro substrates of GPBP. 

Thus, in a preferred embodiment, detection of GPBP expression is used to 
detect an autoimmune condition. A sample that is being tested is compared to a control 
sample for the expression of GPBP, wherein an increased level of GPBP expression 
indicates the presence of an autoimmune condition. In this embodiment, it is preferable 
to use antibodies that selectively bind to GPBPpepl, which is present in GPBP but not 
in GPBPA26. 

Furthermore, as shown in the accompanying examples, GPBP is down- 
regulated in tumor cell lines, and the data suggest that GPBP/GPBPA26 are likely to be 
involved in cell signaling pathways that induce apoptosis, which may be up-regulated 
during autoimmune pathogenesis and down-regulated during cell transformation to 
prevent autoimmune attack to transformed cells during tumor growth. Thus, the 
detection methods disclosed herein can be used to detect cells that are targeted for, or 
are undergoing apoptosis. 

In another aspect, the present invention provides a method for treating an 
autoimmune disorder, a tumor, or for preventing cell apoptosis comprising modification 
of the expression or activity of GPBP, GPBPA26, or a protein comprising a polypeptide 
substantially similarly to GPBPpepl in a patient in need thereof. Modifying the 
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expression or activity of GPBP, GPBPA26, or a protein comprising a polypeptide 
substantially similarly to GPBPpepl can be accomplished by using specific inducers or 
inhibitors of GPBP expression or activity, GPBP antibodies, gene or protein therapy 
using GP or myelin basic protein alternative products, cell therapy using host cells 

5 expressing GP or myelin basic protein alternative products, antisense therapy, or other 
techniques known in the art. In a preferred embodiment, the method further comprises 
administering a substantially purified alternative product of the GP antigen or MBP to 
modify the expression or activity of GPBP, GPBPA26, or a protein comprising a 
polypeptide substantially similarly to GPBPpepl. As used herein, "modification of 

10 expression or activity" refers to modifying expression or activity of either the RNA or 
protein product. 

In a further aspect," the present invention provides pharmaceutical compositions, 
comprising an amount effective of substantially purified alternative products of the GP 
antigen or MBP to modify the expression or activity of GPBP RNA or protein, and a 

1 5 pharmaceutical^ acceptable carrier. 

For administration, the active agent is ordinarily combined with one or more 
adjuvants appropriate for the indicated route of administration. The compounds may be 
mixed with lactose, sucrose, starch powder, cellulose esters of alkanoic acids, stearic 
acid, talc, magnesium stearate, magnesium oxide, sodium and calcium salts of 

20 phosphoric and sulphuric acids, acacia, gelatin, sodium alginate, polyvinylpyrrolidine, 
and/or polyvinyl alcohol, and tableted or encapsulated for conventional administration. 
Alternatively, the compounds of this invention may be dissolved in saline, water, 
polyethylene glycol, propylene glycol, carboxymethyl cellulose colloidal solutions, 
ethanol, corn oil, peanut oil, cottonseed oil, sesame oil, tragacanth gum, and/or various 

25 buffers. Other adjuvants and modes of administration are well known in the 
pharmaceutical art. The carrier or diluent may include time delay material, such as 
glyceryl monostearate or glyceryl distearate alone or with a wax, or other materials well 
known in the art. 

The present invention may be better understood with reference to the 
30 accompanying examples that are intended for purposes of illustration only and should 
not be construed to limit the scope of the invention, as defined by the claims appended 
hereto. 
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Example 1 : Characterization of GPBP 

Here we report the cloning and characterization of a novel type of serine/threonine 
kinase that specifically binds to and phosphorylates the unique N-terminal region of the 
human GP antigen. 

MATERIALS AND METHODS 

Synthetic polymers-Peptides. GPpepl, KGKRGDSGSPATWTTRGFVFT (SEQ 
ID NO:26), representing residues 3-23 of the human GP antigen and GPpepl Ala 9 , 
KGKRGDAGSPATWTTRGFVFT (SEQ ID NO:27), a mutant Ser 9 to Ala 9 thereof, were 
synthesized by MedProbe and CHIRON. FLAG peptide, was from Sigma. 

Oligonucleotides. The following as well as several other GPBP-specific 
oligonucleotides were synthesized by Genosys and GIBCO BRL: 
ON-GPBP-54m: 

TCGAATTCACCATGGCCCCACTAGCCGACTACAAGGACGACGATG 

ACAAG (SEQ ID NO: 28). 

ON-GPBP-55c: 

CCGAGCCCGACGAGTTCCAGCTCTGATTATCCGACATCTTGTCATCG 
TCG (SEQ IDNO:29). 

ON-HNC-B-N-14m: CGGGATCCGCTAGCTAAGCCAGGCAAGGATGG (SEQ ID 
NO:30). 

ON-HNC-B-N-16c: CGGGATCCATGCATAAATAGCAGTTCTGCTGT (SEQ ID 
NO:31). 

Isolation and characterization of cDNA clones encoding human GPBP- 

Several human X-gtl 1 cDNA expression libraries (eye, fetal and adult lung, kidney and 
HeLa S3, from CLONTECH) were probed for cDNAs encoding proteins interacting 
with GPpepl. Nitrocellulose filters (Millipore) prepared following standard 
immunoscreening procedures were blocked and incubated with 1-10 nmoles per ml of 
GPpepl at 37°C. Specifically bound GPpepl was detected using M3/1A monoclonal 
antibodies (7). A single clone was identified in the HeLa-derived library (HeLal). 
Specificity of fusion protein binding was confirmed by similar binding to recombinant 
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eukaryotic human GP antigen. The EcoRl cDNA insert of HeLal (0.5-kb) was used to 
further screen the same library and to isolate overlapping cDNAs. The largest cDNA 
(2.4-kb) containing the entire cDNA of HeLal (n4') was fully sequenced. 

Northern and Southern blots-Pre-made Northern and Southern blots 
5 '(CLONTECH) were probed with HeLal cDNA following manufacturer instructions. 

Plasmid construction, expression and purification of recombinant proteins- 
GPBP-derived material. The original X-gtll HeLal clone was expressed as a lysogen 
in E. Coli Y1089 (8). The corresponding P~galactosidase-derived fusion protein 
containing the N-terminal 150 residues of GPBP was purified from the cell lysate using 
10 an APTG-agarose column (Boehringer). The EcoRI 2.4-kb fragment of n4' was 
subcloned in Blucscribe Ml 3+ vector (Stratagene) (BS-n4'), amplified and used for 
subsequent cloning. A DNA fragment containing (from 5 1 to 3'), an EcoRI restriction 
site, a standard Kozak consensus for translation initiation, a region coding for a tag 
peptide sequence (FLAG, DYKDDDDK (SEQ ID NO:32)), and the sequence coding 
15 for the first eleven residues of GPBP including the predicted Metj and a Ban II 
restriction site, was obtained by hybridizing ON-GPBP-54m and ON-GPBP-55c, and 
extending with modified T 7 DNA polymerase (Amersham). The resulting DNA product 
was digested with EcoRI and Banll, and ligated with the Banll/EcoRI cDNA fragment 
of BS-n4^ in the EcoRI site of pHIL-D2 (Invitrogen) to produce pHIL-FLAG -n4\ This 
20 plasmid was used to obtain Mut s transformants of the GS1 15 strain of Pichia pastoris 
and to express FLAG-tagged recombinant GPBP (rGPBP) either by conventional liquid 
culture or by fermentation procedures {Pichia Expression Kit, Invitrogen). The cell 
lysates were loaded onto an anti-FLAG M2 column (Sigma), the unbound material 
washed out with Tris buffered saline (TBS, 50 mM Tris-HCl, pH 7.4, 150 mM NaCl) 
25 or salt-supplemented TBS (up to 2M NaCl), and the recombinant material eluted with 
FLAG peptide. For expression in cultured human kidney-derived 293 cells (ATCC 
1573-CRL), the 2.4- or 2.0-kb EcoRI cDNA insert of either BS-n4' or pHIL-FLAG-n4' 
was subcloned in pcDNA3 (Invitrogen) to produce pc-n4' and pc-FLAG-n4' 
respectively. When used for transient expression, 18 hours after transfection the cells 
30 were lysed with 3.5-4 ^il/cm 2 of chilled lysis buffer (1% Nonidet P-40 or Triton-XlOO, 
5mM EDTA and 1 mM PMSF in TBS) with or without 0.1% SDS, depending on 
whether the lysate was to be used for SDS-PAGE or FL AG-purification, respectively. 
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For FLAG purification, the lysate of four to six 175 cm 2 culture dishes was diluted up 
to 50 ml with lysis buffer and purified as above. For stable expression, the cells were 
similarly transfected with pc-n4' and selected for three weeks with 800 ng/ml of G418. 
For bacterial recombinant expression, the 2.0-kb EcoRI cDNA fragment of pHIL- 
FLAG-n-T was cloned in-frame downstream of the glutathione ^-transferase (GST)- 
encoding cDNA of pGEX-5x-l (Pharmacia). The resulting construct was used to 
express GST-GPBP fusion protein in DH5a cells (9). 

CP antigen-derived material. Human recombinant GP antigen (rGP) was 
produced in 293 cells using the pRc/CMV-BM40 expression vector containing the a3- 
specific cDNA between ON-HNC-B-N-14m and ON-HNC-B-N-16c. The expression 
vector is a pRc/CMV (Invitrogen)-dcrivcd vector provided by Billy G. Hudson (Kansas 
University Medical Center) that contains cDNA encoding an initiation Met, a BM40 
signal peptide followed by a tag peptide sequence (FLAG), and a polylinker cloning 
site. To obtain a3-specific cDNA, a polymerase chain reaction was performed using 
the oligonucleotides above and a plasmid containing the previously reported a3(IV) 
cDNA sequence (3) as template (clone C2). For stable expression of rGP, 293 cells 
were transfected with the resulting construct (Ta3VLC) and selected with 400 jig/ml of 
G41 S. The harvested rGP was purified using an anti-FLAG M2 column. 

All the constructs were verified by restriction mapping and nucleotide 
sequencing. 

Cell culture and DNA transfection-Human 293 cells were grown in 
Dulbccco's modified Eagle's medium (DMEM) supplemented with 10% fetal calf 
scrum. Trunsfections were performed using the calcium phosphate precipitation method 
of the Pro lection Mammalian Trans fection Systems (Promega). Stably transfected cells 
were selected by their resistance to G41S. Foci of surviving cells were isolated, cloned 
and amplified. 

Antibody production-Po/yc/o/7a/ antibodies against the N-terminal region of 
CiPIif*. Cells expressing HeLal X-gtll as a lysogen were lysed by sonication in the 
presence of .Laemmli sample buffer and subjected to electrophoresis in a 7.5% 
acrylanmle preparative gel. The gel was stained with Coomassie blue and the band 
containing the fusion protein of interest excised and used for rabbit immunization (10). 
The anti-serum was tested for reactivity using APTG-affinity purified antigen. To 
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obtain affinity-purified antibodies, the anti-serum was diluted 1:5 with TBS and loaded 
onto a Sepharose 4B column containing covalently bound affinity purified antigen. The 
bound material was eluted and, unless otherwise indicated, used in the 
immunochemical studies. 

Monoclonal antibodies against GPBP.. Monoclonal antibodies were produced 
essentially as previously reported (7) using GST-GPBP. The supematants of individual 
clones were analyzed for antibodies against rGPBP. 

In vitro phosphorylation assays-About 200 ng of rGPBP were incubated 
overnight at 30°C in 25 mM p-glycerolphosphate (pH 7.0), 0.5 mM EDTA, 0.5 mM 
EGTA, 8 mM MgCl 2 , 5 mM MnCl 2 , 1 mM DTT and 0.132 \xM y- 32 P-ATP, in the 
presence or absence of 0.5-1 |ug of protein substrates or 10 nmoles of synthetic 
peptides, in a total volume of 50 

In vivo phosphorylation assays-Individual wells of a 24-well dish were seeded 
with normal or with stably pc-n4' transfected 293 cells. When the cells were grown to 
the desired density, a number of wells of the normal 293 cells were transfected with pc- 
FLAG-n4\ After 12 hours, the culture medium was removed, 20 |aCi/well of H3 32 P04 
in 100 |al of phosphate- free DMEM added, and incubation continued for 4 hours. The 
cells were lysed with 300 |al/well of TBS containing 1% Triton X-100, 2 mM EDTA, 1 
mM PMSF, 50 mM NaF and 0.2 mM vanadate, and extracted with specific antibodies 
and Protein A-Sepharose. When anti-GPBP serum was used, the lysate was pre-cleared 
using pre-immune serum and Protein A-Sepharose. 

In vitro dephosphorylation of rGPBP-About 1 \xg of rGPBP was 
dephosphorylated in 100 ^1 of 10 mM Tris-acetate (pH 7.5), 10 mM magnesium acetate 
and 50 mM potassium acetate with 0.85 U of calf intestine alkaline phosphatase 
(Pharmacia) for 30 min at 30°C. 

Renaturation assays-In-blot renaturation assays were performed using 1-5 jag 
of rGPBP as previously described (11). 

Nucleotide sequence analysis- cDNA sequence analyses were performed by 
the dideoxy chain termination method using [a] 35 S-dATP, modified T 7 DNA 
polymerase (Amersham) and universal or GPBP-specific primers (8-10). 
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32 P-Phosphoamino acid analysis-Immunopurified rGPBP or HPLC gel- 
filtration fractions thereof containing the material of interest were phosphorylated, 
hydrolyzed and analyzed in one dimensional (4) or two dimensional thin layer 
chromatography (12). When performing two dimensional analysis, the buffer for the 
first dimension was formic acidiacetic acid:water (1:3.1:35.9) (pH 1.9) and the buffer 
for the second dimension was acetic acid rpyridine: water (2:0.2:37.8) (pH 3.5). Amino 
acids were revealed with ninhydrin, and 32 P-phosphoamino acids by autoradiography. 

Physical methods and immunochemical techniques-SDS-PAGE and 
Western-blotting were performed as in (4). Immunohistochemistry studies were done 
on human multi-tissue control slides (Biomeda, Biogenex) using the ABC peroxidase 
method (13). 

Computer analysis-Homology searches were carried out against the GenBank 
and SwissProt databases with the BLAST 2.0 (14) at the NCBI server, and against the 
TIGR Human Gene Index database for expressed sequence tags, using the Institute for 
Genomic Research server. The search for functional patterns and profiles was 
performed against the PROSITE database using the ProfileScan program at the Swiss 
Institute of Bioinformatics (15). Prediction of coiled-coil structures was done at the 
Swiss Institute for Experimental Cancer Research using the program Coils (16) with 
both 2 1 and 28 residue windows. 



RKSULTS 

Molecular cloning of GPBP-To search for proteins specifically interacting with 
the divergent N-terminal region of the human GP antigen, a 21 -residue peptide (GPpepl; 
SEQ ID NO:26)), encompassing this region and flanking sequences, and specific 
monoclonal antibodies against it were combined to screen several human cDNA 
expression libraries. More than 5 x 10 6 phages were screened to identify a single HeLa- 
derived recombinant encoding a fusion protein specifically interacting with GPpepl 
without disturbing antibody binding. 

Using the cDNA insert of the original clone (HeLal), we isolated a 2.4-kb cDNA 
(n4') that contains 408-bp of 5' -untranslated sequence, an open reading frame (ORF) of 
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1872-bp encoding 624 residues, and 109-bp of 3' -untranslated sequence (Fig. 1) (SEQ ID 
NO: 1-2). Other structural features are of interest. First, the predicted polypeptide 
(hereinafter referred to as GPBP) has a large number of phosphorylatable (17.9%) and 
acidic (16%) residues unequally distributed along the sequence. Serine, which is the most 
5 abundant residue (9.3%), shows preference for two short regions of the protein, where it 
comprises nearly 40% of the amino acids, compared to an average of less than 7% 
throughout the rest of the polypeptide chain. It is also noteworthy that the more N- 
terminal, serine-rich region consists mainly of a Ser-Xaa-Yaa repeat. Acidic residues are 
preferentially located at the N-terminal three-quarters of the polypeptide, with nearly 1 8% 
10 of the residues being acidic. These residues represent only 9% in the most C-terminal 
quarter of the polypeptide, resulting in a polypeptide chain with two electrically opposite 
domains. At the N-terminus, the polypeptide contains a pleckstrin homology (PH) 
domain, which has been implicated in the recruitment of many signaling proteins to the 
cell membrane where they exert their biological activities (17). Finally, a bipartite nuclear 
15 targeting sequence (18) exists as an integral part of a heptad repeat region that meets all 
the structural requirements to form a coiled-coil (16). 

Protein data bank searches revealed homologies almost exclusively within the 
approximately 100 residues at the N-terminal region harboring the PH domain. The PH 
domain of the oxysterol-binding protein is the most similar, with an overall identity of 
20 33.5% and a similarity of 65.2% with GPBP. In addition, the Caenorhabditis elegans 
cosmid F25H2 (accession number Q93569) contains a hypothetical ORF that displays an 
overall identity of 26.5% and a similarity of 61% throughout the entire protein sequence, 
indicating that similar proteins are present in lower invertebrates. Several human 
expressed sequence tags (accession numbers AA287878, AA287561, AA307431, 
25 AA331618, AA040134, AA158618, AA040087, AA122226, AA158617, AA121104, 
AA4 12432, AA412433, AA282679 and N27578) possess a high degree of nucleotide 
identity (above 98%) with the corresponding stretches of the GPBP cDNA, suggesting 
that they represent human GPBP. Interestingly, the AA287878 EST shows a gap of 67 
nucleotides within the sequence corresponding to the GPBP 5' -untranslated region, 
30 suggesting that the GPBP pre-mRNA is alternatively spliced in human tissues (not 
shown). 
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The distribution and expression of the GPBP gene in human tissues was first 
assessed by Northern blot analysis (Fig. 2, panel A). The gene is expressed as two major 
mRNAs species between 4.4-kb and 7.5-kb in length and other minor species of shorter 
lengths. The structural relationship between these multiple mRNA species is not known 
and their relative expression varies between tissues. The highest expression level is seen in 
striated muscle (skeletal and heart), while lung and liver show the lowest expression 
levels. 

Southern blot studies analysis of genomic DNA from different species indicated 
that homologous genes exist throughout phylogeny (Fig. 2, panel B). Consistent with the 
human origin of the probe, the hybridization intensities decreased in a progressive fashion 
as the origin of the genomic DNA moves away from humans in evolution. 

Experimental determination of the translation start site-To experimentally 
confirm the predicted ORF, eukaryotic expression vectors containing either the 2.4-kb of 
cDNA of n4\ or only the predicted ORF tagged with a FLAG sequence (Fig. 3 A), were 
used for transient expression assays in 293 cells. The corresponding extracts were 
analyzed by immunoblot using GPBP- or FLAG-specific antibodies. The GPBP-specific 
antibodies bind to a similar major polypeptide in both transfected cells, but only the 
polypeptide produced by the engineered construct expressed the FLAG sequence (Fig. 
3B). This located the translation start site of the n4' cDNA at the predicted Met and 
confirmed the proposed primary structure. Furthermore, the recombinant polypeptides 
displayed a molecular mass higher than expected (80 versus 71 kDa) suggesting that 
GPBP undergoes post-translational modifications. 

Expression and characterization of yeast rGPBP- Yeast expression and FLAG- 
based affinity-purification were combined to produce rGPBP (Fig. 4A). A major 
polypeptide of -89 kDa, along with multiple related products displaying lower M r , were 
obtained. The recombinant material was recognized by both anti-FLAG and GPBP- 
specific antibodies, guaranteeing the fidelity of the expression system. Again, however, 
the M v displayed by the major product was notably higher than predicted and even higher 
than the M r of the 293 cell-derived recombinant material, supporting the idea that GPBP 
undergoes important and differential post-translational modifications. Since 
phosphorylatable residues are abundant in the polypeptide chain, we investigated the 
existence of phosphoamino acids in the recombinant materials. By using monoclonal or 
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polyclonal (not shown) antibodies against phosphoserine (Pser), phosphothreonine (PThr) 
and phosphotyrosine (PTyr), we identified the presence of all three phosphoresidues either 
in yeast rGPBP (Fig. 4B) or in 293 cell-derived material (not shown). The specificity of 
the antibodies was further assessed by partially inhibiting their binding by the addition of 
5-10 mM of the corresponding phosphoamino acid (not shown). This suggests that the 
phosphoresidue content varies depending upon the cell expression system, and that the M r 
differences are mainly due to phosphorylation. Dephosphorylated yeast-derived material 
consistently displayed similar M T to the material derived from 293 cells, and 
phosphoamino acid content correlates with SDS-PAGE mobilities (Fig.4C). As an in vivo 
measurement, the phosphorylation of rGPBP in the 293 cells was assessed (Fig. 4D). 
Control cells (lanes 1) and cells expressing rGPBP in a stable (lanes 2) or transient (lanes 
3) mode were cultured in the presence of H3 32 PC>4. Immunoprecipitated recombinant 
material contained 32 P, indicating that phosphorylation of GPBP occurred in vivo and 
therefore is likely to be a physiological process. 

The rGPBP is a serine/threonine kinase that phosphorylates the N-terminal 
region of the human GP antigen -Although GPBP does not contain the conserved 
structural regions required to define the classic catalytic domain for a protein kinase, the 
receni identification and characterization of novel non-conventional protein kinases (19- 
27) encouraged the investigation of its phosphorylating activity. Addition of [y 32 P]ATP to 
rGPBP (either from yeast or 293 cells (not shown)) in the presence of Mn 2+ and Mg 2+ 
resulted in the incorporation of 32 P as PSer and PThr in the major and related products 
recognized by both anti-FLAG and specific antibodies (Fig. 5A and B), indicating that the 
affinitv-purified material contains a Ser/Thr protein kinase. To further characterize this 
activity, GPpepl, GPpeplAla 9 (a GPpepl mutant with Ser 9 replaced by Ala), native and 
recombinant human GP antigens, and native bovine GP antigen were assayed (Fig. 5C). 
Affinity-purified rGPBP phosphorylates all human-derived material to a different extent. 
However, in similar conditions, no appreciable 32 P-incorporation was observed in the 
bovine-derived substrate. The lower 32 P incorporation displayed by GPpeplAla 9 when 
compared with GPpepl, and the lack of phosphorylation of the bovine antigen, indicates 
that the kinase present in rGPBP discriminates between human and bovine antigens, and 
that Ser } is a target for the kinase. 
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Although the purification system provides high quality material, the presence of 
contaminants with a protein kinase activity could not be ruled out. The existence of 
contaminants was also suggested by the presence of a FLAG-containing 40 kDa 
polypeptide, which displayed no reactivity with specific antibodies nor incorporation of 
32 P in the phosphorylation assays (Fig. 4A and 5A). To precisely identify the polypeptide 
harboring the protein kinase activity, we performed in vitro kinase renaturation assays 
after SDS-PAGE and Westem-blotted (Fig. 6). We successfully combined the use of 
specific antibodies (lane 1) and autoradiographic detection of in situ 32 P-incorporation 
(lane 2), and identified the 89 kDa rGPBP material as the primary polypeptide harboring 
the Ser/Thr kinase activity. The lack of 32 P- incorporation in the rGPBP-derived products, 
as well as in the 40 kDa contaminant, further supports the specificity of the renaturation 
assays and locates the kinase activity to the 89 kDa polypeptide. Recently, it has been 
shown that traces of protein kinases intimately associated with a polypeptide can be 
released from the blot membrane, bind to, and phosphorylate the polypeptide during the 
labeling step (28). To assess this possibility in our system, we performed renaturation 
studies using a small piece of membrane containing the 89 kDa polypeptide, either alone 
or together with membrane pieces representing the different regions of the blot lane. We 
observed similar 32 P-incorporation at the 89 kDa polypeptide regardless of the co- 
incubated pieces (not shown), indicating that if there are co-purified protein kinases in our 
sample they are not phosphorylating the 89 kDa polypeptide in the renaturation assays 
unless they co-migrate. Co-migration does not appear to be a concern, however, since 
rGPBP deletion mutants (GPBPA26 and R3; see below) displaying different mobilities 
also have kinase activities and could be similarly in-blot renatured (not shown). 

Immunohistochemical localization of the novel kinase-To investigate GPBP 
expression in human tissues we performed immunohistochemical studies using specific 
polyclonal (Fig.7) or monoclonal antibodies (not shown). Although GPBP is widely 
expressed in human tissues, it shows tissue and cell-specificity. In kidney, the major 
expression is found at the tubule epithelial cells and the glomerular mesangial cells and 
podocytes. At the lung alveolus, the antibodies display a linear pattern suggestive of a 
basement membrane localization, along with staining of pneumocytes. Liver shows low 
expression in the parenchyma, but high expression in biliary ducts. Expression in the 
central nervous system is observed in the white matter, but not in the neurons of the brain. 
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In testis, a high expression in the spermatogonium contrasts with the lack of expression in 
Sertoli cells. The adrenal gland shows a higher level of expression in cortical cells versus 
the medullar. In the pancreas, GPBP is preferentially expressed in Langerhans islets 
versus the exocrine moiety. In prostate, GPBP is expressed in the epithelial cells but not 

5 in the stroma (Fig. 7). Other locations with high expression of GPBP are striated muscle, 
epithelial cells of intestinal tract, and Purkinje cells of the cerebellum (not shown). In 
general, in tissues where GPBP is highly expressed the staining pattern is mainly diffuse 
cytosolic. However in certain locations there is, in addition, an important staining 
reinforcement at the nucleus (spermatogonium), at the plasma membrane (pneumocyte, 

1 0 hepatocyte, prostate epithelial cells, white matter) or at the extracellular matrix (alveolus) 
(Fig. 7). 

DISCUSSION 

Our data show that GPBP is a novel, non-conventional serine/threonine kinase. 
15 We also present evidence that GPBP discriminates between human and bovine GP 
antigens, and targets the phosphorylatable region of human GP antigen in vitro. Several 
lines of evidence indicate that the 89 kDa polypeptide is the only kinase in the affinity 
purified rGPBP. First, we found no differences in auto- or trans-phosphorylation among 
rGPBP samples purified in the presence of 150 mM, 0.5 M, 1 M or 2 M salt (not shown), 
20 suggesting that rGPBP does not carry intimately bound kinases. Second, there is no 
FLAG-containing, yeast-derived kinase in our samples, since material purified using 
GPBP-specific antibodies shows no differences in phosphorylation (not shown). Third, a 
deletion mutant (GPBPA26; see below) displays reduced auto- and trans-phosphorylation 
activities (not shown), demonstrating that the 89 kD polypeptide is the only portion of the 
25 rGPBP with the ability to carry out phosphate transfer. 

Although GPBP is not homologous to other non-conventional kinases, they share 
some structural features including an N-terminal a-helix coiled-coil (26, 27), serine-rich 
motifs (24), high phosphoamino acids content (27), bipartite nuclear localization signal 
(27), and the.absence of a typical nucleotide or ATP binding motif (24, 27). 
30 Immunohistochemistry studies show that GPBP is a cytosolic polypeptide also 

found in the nucleus, associated with the plasma membrane and likely at the extracellular 
matrix associated with the basement membrane, indicating that it contains the structural 



WO 00/50607 



31 



PCT/IBOO/00324 



requirements to reach all these destinations. The nuclear localization signal and the PH 
domain confer to it the potential to reach the nucleus and the cell membrane, respectively 
(17, 29, 30). Although GPBP does not contain the structural requirements to be exported, 
the 5'-end untranslated region df its mRNA includes an upstream ORF of 130 residues 
with an in- frame stop codon at the beginning (Fig. 1). A mRNA editing process inserting a 
single base pair (U) would generate an operative in-frame start site and an ORF of 754- 
residues containing an export signal immediately downstream of the edited Met (not 
shown). Polyclonal antibodies against a synthetic peptide representing part of this 
hypothetical extra-sequence (PRSARCQARRRRGGRTSS (SEQ ID NO:33)) display a 
linear vascular reactivity in human tissues suggestive of an extracellular basement 
membrane localization (data not shown). 

Alternatively, a splicing phenomenon could generate transcripts with additional 
unidentified exon(s) that would provide the structural requirements for exportation. The 
multiple cellular localization, the high content in PTyr, and the lack of tyrosine kinase 
activity in vitro, suggest that GPBP is itself the target of specific tyrosine kinase(s) and 
therefore likely involved in specific signaling cascade(s). 

As discussed above, specific serine phosphorylation, as well as pre-mRNA 
alternative splicing, are associated with the biology of several autoantigens, including the 
GP antigen, acetylcholine receptor and myelin basic protein (MBP) (4). The latter is 
suspected to be the major antigen in multiple sclerosis (MS), another exclusively human 
autoimmune disease in which the immune system targets the white matter of the central 
nervous system. GP disease and MS are human disorders that display a strong association 
with the same HLA class II haplotype (HLA DRB1*1501)(32, 33). This, along with the 
recent report of death by GP disease of a MS patient carrying this HLA specificity (34), 
supports the existence of common pathogenic events in these human disorders. 

Phosphorylation of specific serines has been shown to change intracellular 
proteolysis (35-40). Conceivably, alterations in protein phosphorylation can affect 
processing and peptide presentation, and thus mediate autoimmunity. GP antigen-derived 
peptide presentation by the HLA-DR15 depends more on processing than on preferences 
of relatively indiscriminate DR15 molecules (41), suggesting that if processing is 
influenced by abnormal phosphorylation, the resulting peptides would likely be presented 
by this HLA. Our more recent data indicate that in both the GP and MBP systems, the 
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production of alternative splicing products serves to regulate the phosphorylation of 
specific and structurally homologous PKA sites, suggesting that this or a closely related 
kinase is the in vivo phosphorylating enzyme. Alterations in the degree of antigen 
phosphorylation, caused either by an imbalance in alternative products, or by the action of 
an intruding kinase that deregulates phosphorylation of the same motifs, could lead to an 
autoimmune response in predisposed individuals. rGPBP phosphorylates the human GP 
antigen at a major PKA phosphorylation site in an apparently unregulated fashion, since 
the presence of specific alternative products of the GP antigen did not affect 
phosphorylation of the primary antigen by GPBP (not shown). 

Although GPBP is ubiquitously expressed, in certain organs and tissues it shows a 
preference for cells and tissue structures that are target of common autoimmune responses: 
the Langerhans cells (type I diabetes); the white matter of the central nervous system 
(multiple sclerosis); the biliary ducts (primary biliary cinrhosis); the cortical cells of the 
adrenal gland (Addison disease); striated muscle cells (myasthenia gravis); 
spermatogonium (male infertility); Purkinje cells of the cerebellum (paraneoplasic 
cerebellar degeneration syndrome); and intestinal epithelial cells (pernicious anemia, 
autoimmune gastritis and enteritis). All the above observations point to this novel kinase 
as an attractive candidate to be considered when envisioning a model for human 
autoimmune disease. 
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Example 2: GPBP Alternative Splicing 

Here we report the existence of two isoforms of GPBP that are generated by 
alternative splicing of a 78-base pair (bp) long exon that encodes a 26-residue serine- 
rich motif. Both isoforms, GPBP and GPBPA26, exist as high molecular aggregates 
5 that result from polypeptide self-aggregation. The presence of the 26-residue peptide in 
the polypeptide chain results in a molecular species that self-interacts more efficiently 
and forms aggregates with higher specific activity. Finally, we present evidences 
supporting the observation that GPBP is implicated in human autoimmune 
pathogenesis. 

10 MATERIAL AND METHODS. 
Synthetic polymers: 

Peptides. GPpep 1 , KGKRGDSGSP ATWTTRGFVFT (SEQ ID NO:26), is described in 
Example 1. GPBPpepl, PYSRSSSMSSIDLVSASDDVHRFSSQ (SEQ ID NO:14), _ 
representing residues 371-396 of GPBP was synthesized by Genosys. 

15 Oligonucleotides. The following oligonucleotides were synthesized by Life 
Technologies, Inc., 5' to 3': ON-GPBP-1 lm, G CGG GAC TCA GCG GCC GGA TTT 
TCT (SEQ ID NO:34); ON-GPBP-1 5m. AC AGC TGG CAG AAG AGA C (SEQ ID 
NO:35i. ON-GPBP-20c, C ATG GGT AGC TTT TAA AG (SEQ ID NO; 36); ON- 
GPBP-22m, TA GAA GAA CAG TCA CAG AGT GAA AAG G (SEQ ID NO;37); 

20 ON-GPBP-53c, GAATTC GAA CAA AAT AGG CTT TC (SEQ ID NO:38); ON- 
GPBPoom, CCC TAT AGT CGC TCT TC (SEQ ID NO:39); ON-GPBP-57c, CTG 
GGA GCT GAA TCT GT (SEQ ID NO:40); ON-GPBP-62c, GTG GTT CTG CAC 
CAT CTC TTC AAC (SEQ ID NO:41) ; ON-GPBP-A26, CA CAT AGA TTT GTC 
CAA AAC. GTT GAA GAG ATG GTG CAG AAC (SEQ ID NO:42). 

25 Reverse transcriptase and polymerase chain rection (RT-PCR). Total RNA was 
prepared from different control and GP tissues as described in (15). Five micrograms of 
total RNA was retrotranscribed using Ready-To-Go You-Prime First-Strand beads 
(Amersham Pharmacia Biotech) and 40 pmol of ON-GPBP-53c. The corresponding 
cDNA was subjected to PCR using the pairs of primers ON-GPBP-1 lm/ON-GPBP-53c 

3() or ON-GPBP-1 5m/ON-GPBP-62c. The identity of the products obtained with 15m-62c 
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was further confirmed by Alu I restriction. To specifically amplify GPBP transcripts, 
PCR was performed using primers ON-GPBP-15m/ON-GPBP-57c. 
Northern hybridization studies. Pre-made human multiple-tissue and tumor cell-line 
Northern Blots (CLONTECH) were probed with a cDNA containing the 78-bp exon 
5 present only in GPBP or with a cDNA representing both isoforms. The corresponding 
cDNAs were obtained by PCR using the pair of primers ON-GPBP-56m and ON- 
GPBP-57c using GPBP as a template, or with primers ON-GPBP-22m and ON-GPBP- 
20c, using GPBPA26 as a template. The resulting products were random-labeled and 
hybridized following the manufacturers' instructions. 
10 Plasmid construction, expression and purification of recombinant proteins. The 
plasmid pHIL-FLAG-n4\ used for recombinant expression of FLAG-tagged GPBP in 
Pichia pastoris has been described elsewhere (4). The sequence coding for the 78-bp 
exon was deleted by site-directed mutagenesis using ON-GPBP-A26 to generate the 
plasmid pHIL-FLAG-n4'A26. Expression and affinity-purification of recombinant 
1 5 GPBP and GPBPA26 was done as in (4). 

Gel-filtration HPLC. Samples of 250 were injected into a gel filtration PE-TSK- 
G4000SW HPLC column equilibrated with 50 mM Tris-HCl pH 7.5, 150 mM NaCl. 
The material was eluted from the column at 0.5 ml/min, monitored at 220 nm and 
minute fractions collected. 
20 In vitro phosphorylation assays. The auto-, trans-phosphorylation and in-blot 
renaturation studies were performed as in Example 1 . 

Antibodies and immunochemical techniques. Polyclonal antibodies were raised by in 
chicken against a synthetic peptide (GPBPpepl) representing the sequence coded by 
the 78-bp exon (Genosys). Egg yolks were diluted 1:10 in water, the pH adjusted to 5.0. 

25 After 6 hours at 4*0, the solution was clarified by centrifugation (25 min at 10000 x g at 
4°C) and the antibodies precipitated by adding 20 % (w/v) of sodium sulfate at 20.000 x 
g, 20'. The pellets were dissolved in PBS (1 ml per yolk) and used for 
immunohistochemical studies. The production of antibodies against GPBP/ GPBP A2 6 
or against a3(IV)NCl domain are discussed above (see also 4, 13). 

30 Sedimentation velocity. Determination of sedimentation velocities were performed in 
an Optima XL-A analytical ultracentrifuge (Beckman Instruments Inc.), equipped with 
a VIS-UV scanner, using a Ti60 rotor and double sector cells of Epon-charcoal of 12 
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mm optical path-length. Samples of ca. 400 jal were centrifuged at 30,000 rpm at 20°C 
and radial scans at 220 nm were taken every 5 min. The sedimentation coefficients 
were obtained from the rate of movement of the solute boundary using the program 
XLAVEL (supplied by Beckman). 
5 Sedimentation equilibrium. Sedimentation equilibrium experiments were done as 
described above for velocity experiments with samples of 70 and centrifuged at 
8,000 rpm. The experimental concentration gradients at equilibrium were analyzed 
using the program EQASSOC (Beckman) to determine the corresponding weight 
average molecular mass. A partial specific volumes of 0.71 1 cm 3 /g for GPBP and 0.729 

10 cm'Vg for GPBPA26 were calculated from the corresponding amino acid compositions. 
Physical methods and immunochemical techniques. SDS-PAGE and Western 
blotting were performed under reducing conditions as previously described (3). 
Immunohistochemistry studies were done on formalin fixed paraffin embedded tissues • 
using the ABC peroxidase method (4) or on frozen human biopsies fixed with cold 

15 acetone using standard procedures for indirect immunofluorescence. 

Two hybrid studies. Self-interaction studies were carried out in Saccharomyces 
cercvisiac (HF7c) using pGBT9 and pGAD424 (CLONTECH) to generate GAL4 
binding and activation domain-fusion proteins, respectively. Interaction was assessed 
following the manufacture's recommendations. P-galactosidase activity was assayed 

20 with X-GAL (0.75 mg/ml) for in situ and with ortho-nitrophenyl P-D galactopyranoside 
(0.64 mg/ml) for the in-solution determinations. 

RESULTS 

Identification of two spliced GPBP variants. To characterize the GPBP 
25 species in normal human tissues, we coupled reverse transcription to a polymerase 
chain reaction (RT-PCR) on total RNA from different tissues, using specific 
oligonucleotides that flank the full open reading frame of GPBP. A single cDNA 
fragment displaying lower size than expected was obtained from skeletal muscle- 
derived RNA-(Fig.8A), and from kidney, lung, skin, or adrenal gland-derived RNA (not 
30 shown). By combining nested PCR re-amplifications and endonuclease restriction 
mapping, we determined that all the RT-PCR products corresponded to the same 
molecular species (not shown). We fully sequenced the 2.2-Kb of cDNA from human 
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muscle and found it identical to HeLa-derived material except for the absence of 78- 
nucleotides (positions 1519-1596), which encode a 26-residues motif (amino acids 371- 
396) (Fig. 8B). We therefore named this more common isoform of GPBP as GPBPA26. 

To investigate whether the 78-bp represent an exon skipped transcript during 
5 pre-mRNA processing, we used this cDNA fragment to probe a human-derived 
genomic library and we isolated a -14-Kb clone. By combining Southern blot 
hybridization and PCR, the genomic clone was characterized and a contiguous DNA 
fragment of 12482-bp was fully sequenced (SEQ ED 25). The sequence contained (from 
5* to 3*), 767-bp of intron sequence, a 93-bp exon, an 818-bp intron, the 78-bp exon 
10 sequence of interest, a 9650-bp intron, a 96-bp exon and a 980-bp intron sequence (Fig. 
8C). The exon-intron boundaries determined by comparing the corresponding DNA and 
cDNA sequences meet the canonical consensus for 5' and 3' splice sites (Fig 8C) (5), 
thus confirming the exon nature of the 78-bp sequence. The GPBP gene was localized 
to chromosome 5ql3 by fluorescence in situ hybridization (FISH) using the genomic 
15 clone as a probe (not shown). 

The relative expression of GPBP in human-derived specimens was assessed by 
Northern blot analysis, using either the 78-bp exon or a 260-bp cDNA representing the 
flanking sequence of 78-bp (103-bp 5* and 157-bp 3') present in both GPBP and 
GPBPA20 (Fig. 9). The 78-bp containing the molecular species of interest were 
20 preferably expressed in striated muscle (both skeletal and heart) and brain, and poorly 
expressed in placenta, lung and liver. In contrast to GPBPA26, the GPBP was expressed 
at very low levels in kidney, pancreas and cancer cell lines. 

All the above indicates that GPBP is expressed at low levels in normal human 
tissues, and that the initial lack of detection by RT-PCR of GPBP can be attributed to a 

25 preferential amplification of the more abundant GPBPA26. Indeed, the cDNA of GPBP 
could be amplified from human tissues (skeletal muscle, lung, kidney, skin and adrenal 
gland) when the specific RT-PCR amplifications were done using 78-bp exon-specific 
oligonucleotides (not shown). This also suggests that GPBPA26 mRNA is the major 
transcript detected in Northern blot studies when using the cDNA probe representing 

30 both GPBP and GPBPA26. 
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Recombinant expression and functional characterization of GPBPA26. To 

investigate whether the absence of the 26-residue serine-rich motif would affect the 
biochemical properties of GPBP, we expressed and purified both isoforms (rGPBP and 
rGPBPA26), and assessed their auto- and trans-phosphorylation activities (Fig. 10). As 
reported above for rGPBP (see also 4), rGPBPA26 is purified as a single major 
polypeptide and several related minor products (Fig. 10 A). However, the number and 
relative amounts of the derived products vary compared to rGPBP, and they display M r 
on SDS-PAGE that cannot be attributed simply to the 26-residue deletion. This 
suggests that the 26-residue motif has important structural and functional consequences 
that could account for the reduced in-solution auto- and trans-phosphorylation activities 
displayed by rGPBPA26 (Fig.lOB). Interestingly, the differences in specific activity 
shown in the in-solution assays were not evident when autophosphorylation was 
assessed in-blot after SDS-PAGE and renaturation, suggesting that the 26-residue motif 
likely has important functional consequences at the quaternary structure level. 
Renaturation studies further showed that phosphate transfer activities reside in the 
major polypeptides representing the proposed open reading frames, and are not 
delectable in derived minor products. 

rGPBP and rGPBP-26 exist as very active high molecular weight 
aggregates- Gel filtration analysis of affinity-purified rGPBP or rGPBP A26 yielded 
two chromatographic peaks (I and II), both displaying higher MW than expected for the 
individual molecular species, as determined by SDS-PAGE studies (89 kDa and 84 
kDa. respectively ) (Fig. 11). The bulk of the recombinant material eluted as a single 
peak between the 1 58 kDa and the 669 kDa molecular weight markers (peak II), while 
limited amounts of rGPBP and only traces of rGPBPA26 eluted in peak I (>1000 kDa). 
Aliquots of fractions representing each chromatographic profile were subjected to SDS- 
PAGE and stained, or incubated in the presence of 32 P[y] ATP, and analyzed by 
immunoblot and autoradiography. Along with the major primary polypeptide, every 
chromatographic peak contained multiple derived products of higher or lower sizes 
indicating that the primary polypeptide associates to form high molecular weight 
aggregates that are stabilized by covalent and non-covalent bonds (not shown). The 
kinase activity also exhibited two peaks coinciding with the chromatographic profiles. 
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However, peak I showed a much higher specific activity than peak II, indicating that 
these high molecular weight aggregates contained a much more active form of the 
kinase. Equal volumes of rGPBP fractions number 13 and 20 exhibited comparable 
phosphorylating activity, even though the protein content is approximately 20 times 
lower in fraction 13, as estimated by Western blot and Coomasie blue staining (Fig. 
1 1 A): The specific activities of rGPBP and rGPBP A26 at peak II are also different, and 
are consistent with the studies shown for the whole material, thus supporting the 
hypothesis that the presence of the 26-rediue serine-rich motif renders a more active 
kinase. These results also suggest that both rGPBP and rGPBP A26 exist as oligomers 
under native conditions, and that both high molecular weight aggregate formation and 
specific activity are greatly dependent on the presence of the 26-residue serine-rich 
motif Analytical centrifugation analysis of rGPBP revealed that peak I contained large 
aggregates (over 10 7 Da). Peak II of rGPBP contained a homogenous population of 220 
± 10 kDa aggregates, likely representing trimers with a sedimentation coefficient of 
1 IS. Peak II of rGPBP A26 however consisted of a more heterogenous population that 
likely contains several oligomeric species. The main population (ca. 80%) displayed a 
weight average molecular mass of 310 ± 10 kDa and a coefficient of sedimentation of 
14S. 

GPBP and GPBPA26 self-interact in a yeast two-hybrid system. To assess 
the physiological relevance of the self-aggregation, and to determine the role of the 26- 
residue motif we performed comparative studies using a two-hybrid interaction system 
in yeast. In this type of study, the polypeptides whose interaction is under study are 
expressed as a part of a fusion protein containing either the activation or the binding 
domains of the transcriptional factor GAL4. An effective interaction between the two 
fusion proteins through the polypeptide under study would result in the reconstitution 
of the transcriptional activator and the subsequent expression of the two reporter genes, 
Lac Z and His3, allowing colony color detection and growth in a His-defective 
medium, respectively. We estimated the intensity of interactions by the growth-rate in 
histidine-defective medium, in the presence of different concentrations of a competitive 
inhibitor of the His3 gene product (3-AT), and a quantitative colorimetric liquid (3- 
galactosidase assay. A representative experiment is presented in Fig. 12. When 
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assaying GPBPA26 for self-interaction, a significant induction of the reporter genes 
was observed, while no expression was detectable when each fusion protein was 
expressed alone or with control fusion proteins. The insertion of the 26-residue motif in 
the polypeptide to obtain GPBP resulted in a notable increase in polypeptide 
5 interaction. All of the above data indicate that GPBPA26 self-associates in vivo, and 
that the insertion of the 26-residues into the polypeptide chain yields a more interactive 
molecular species. 

GPBP is highly expressed in human but not in bovine and murine 
glomerulus and alveolus. We have shown that GPBP/GPBPA26 is preferentially 

10 expressed in human cells and tissues that are commonly targeted in naturally occurring 
autoimmune responses. To specifically investigate the expression of GPBP, we raised 
polyclonal antibodies against a synthetic peptide' representing the 26-residue motif 
characteristic of this kinase isoform, and used it for immunohistochemical studies on 
frozen or formalin fixed paraffin embedded human tissues (Fig 13). In general, these " 

15 antibodies showed more specificity than the antibodies recognizing both isoforms for 
the tissue structures that are target of autoimmune responses such as the biliary ducts, 
the Langerhans islets or the white matter of the central nervous system (not shown). 
Nevertheless, the most remarkable finding was the presence of linear deposits of 
GPBP-selective antibodies around the small vessels in every tissue studied (A), 

20 suggesting that GPBP is associated with endothelial basement membranes. 
Consequently, at the glomerulus, the anti-GPBP antibodies displayed a vascular pattern 
closely resembling the glomerular basement membrane staining yielded either by 
monoclonal antibodies specifically recognizing the a3(IV)NCl (compare 13B with 
13C and 13D), or by circulating GP autoantibodies (compare 13E and 13F). These 

25 observations further supported the initial observation that GPBP is expressed in tissue 
structures targeted in natural autoimmune responses, suggesting that the expression of 
GPBP is a risk factor and makes the host tissue vulnerable to an autoimmune attack. 

To further assess this hypothesis, we investigated the presence of GPBP and 
GPBPA26 in the glomerulus of two mammals that naturally do not undergo GP disease 
30 compared to human (Fig. 14). GPBP-specific antibodies failed to stain the glomerulus 
of both bovine or murine specimens (compare 14A with 14B and 14C) while antibodies 
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recognizing the N-terminal sequence common to both GPBP and GPBPA26 stained 
these structures in all three species, although with different distributions and intensities 
(14D-14F). In bovine renal cortex, GPBPA26 was-expressed at a lower rate than in 
human, but showed similar tissue distribution. In murine samples, however, GPBPA26 

5 displayed a tissue distribution closely resembling that of GPBP in human glomerulus. 
Similar results were obtained when studying the alveolus in the three different species 
(not shown). To rule out that the differences in antibody detection was due to primary 
structure differences rather than to a differential expression, we determined the 
corresponding primary structures in these two species by cDNA sequencing. Bovine 

10 and mouse GPBP (SEQ ID NOS:3-6 and 9-12) displayed an overall identity with 
human material of 97.9% and 96.6% respectively. Furthermore, the mouse 26-residue 
motif was identical to human while bovine diverged only in one residue. Finally, and 
similarly to human, wc successfully amplified GPBP cDNA from mouse or bovine 
kidney total RNA using oligonucleotides specific for the corresponding 78-bp exons, 

15 indicating that GPBP is expressed at very low levels not detectable by immunochemical 
techniques. 

GPBP is highly expressed in several autoimmune conditions. We analyzed 
several tissues from different GP patients by specific RT-PCR to assess 
GPBP/GPBPA26 mRNA levels. As in control kidneys, the major expressed isoform in 

20 GP kidneys was GPBPA26. However, in the muscle of one of the patients, GPBP was 
preferentially expressed, whereas GPBPA26 was the only isoform detected in control 
muscle samples (Fig. 15 A). Since we did not have kidney samples from this particular 
patient, we could not assess GPBP/GPBPA26 expression in the corresponding target 
organ. For similar reasons, we could not assess GPBP/GPBPA26 levels in the muscle of 

25 the patients in which kidneys were studied. Muscle cells express high levels of 
GPBP/GPBPA26 (see Northern blot in Fig. 9), and they comprise the bulk of the tissue. 
In contrast, the expression of GPBP/GPBPA26 in the kidney was much less, and the 
glomerulus was virtually the only kidney structure expressing the GPBP isoform (see 
Fig. 13). The glomerulus is a relatively less abundant structure in kidney than the 

30 myocyte is in muscle, and the glomerulus is the structure targeted by immune attack in 
GP pathogenesis. These factors, together with the preferential amplification of the more 
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abundant and shorter messages when performing RT-PCR studies, could account for 
the lack of detection of GPBP in both normal and GP kidneys, thus precluding the 
assessment of GPBP expression at the glomerulus during pathogenesis. Nevertheless, 
the increased levels of GPBP in a GP patient suggest that GPBP/GPBPA26 expression 
5 is altered during GP pathogenesis, and that augmented GPBP expression has a 
pathogenic significance in GP disease. 

To investigate the expression of GPBP and GPBPA26 in autoimnune 
pathogenesis, we studied cutaneous autoimmune processes and compared them with 
control samples representing normal skin or non-autoimmune dermatitis (Fig. 15). 

10 Control samples displayed a limited expression of GPBP in the most peripheral 
keratinocytes (15B, 15E), while keratinocytes expanding from stratum basale to 
corneum expressed abundant GPBP in skin affected by systemic lupus erythematosus 
(SLE) (15C, 15F) or lichen planus (15D, 15G). GPBP was preferentially expressed in 
cell surface structures that closely resembled the blebs previously described in cultured 

15 keratinocytes upon UV irradiation and apoptosis induction (6). In contrast, antibodies 
recognizing both GPBP and GPBPA26 yielded a diffuse cytosolic pattern through the 
whole epidermis in both autoimmune affected or control samples (not shown). These 
data indicate that in both control and autoimmune-affected keratinocytes, GPBPA26 
was expressed at the cytosol and that the expression did not significantly vary during 

20 cell differentiation. In contrast, mature keratinocytes were virtually the only GPBP 
expressing cells. However, bleb formation and expression of GPBP was observed in the 
early stages of differentiation in epidermis affected by autoimmune responses (15C, 
15D, 15F, 15G). This further supports previous observations indicating that aberrant 
apoptosis at the basal keratinocytes is involved in the pathogenesis of autoimmune 

25 processes affecting skin (7), and suggests that apoptosis and GPBP expression are 
linked in this human cell system. . 

DISCUSSION 

Alternative pre-mRNA splicing is a fundamental mechanism for differential 
gene expression that has been reported to regulate the tissue distribution, intracellular 
30 localization, and function of different protein kinases (8-1 1). In this regard, and closely 
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resembling GPBP, B-Raf exists as multiple spliced variants, in which the presence of 
specific exons renders more interactive, efficient and oncogenic kinases (12). 

Although it is evident that rGPBPA26 still bears the uncharacterized catalytic 
domain of this novel kinase, both auto- and trans-phosphorylating activities are greatly 
reduced when compared to rGPBP. Gel filtration and two hybrid experiments provide 
some insights into the mechanisms that underlie such a reduced phosphate transfer 
activity. About 1-2% of rGPBP is organized in very high molecular weight aggregates 
that display about one third of the phosphorylating activity of rGPBP, indicating that 
high molecular aggregation renders more efficient quaternary structures. Recombinant 
GPBPA26, with virtually no peak I material, consistently displayed a reduced kinase 
activity. However, aggregation does not seem to be the only mechanism by which the 
26-residues increases specific activity, since the rGPBP A26 material present in peak II 
also shows a reduced phosphorylating activity when compared to homologous fractions 
of rGPBP. One possibility is that rGPBP-derived aggregates display higher specific 
activities because of quaternary structure strengthening caused by the insertion of the 
26-residue motif. The oligomers are kept together mainly by very strong non-covalent 
bonds, since the bulk of the material appears as a single polypeptide in non-reducing 
SDS-PAGE, and the presence of either 8 M urea or 6 M guanidine had little effect on 
chromatographic gel filtration profiles (not shown). How the 26-residue motif renders a 
more strengthened and active structure remains to be clarified. Conformational changes 
induced by the presence of an exon encoded motif that alter the activation status of the 
kinase have been proposed for the linker domain of the Src protein (24) and exons 8b 
and 10 of B-Raf (12). Alternatively, the 26-residue motif may provide the structural 
requirements such as residues whose phosphorylation may be necessary for full 
activation of GPBP. 

We have reported (13) that the primary structure of the GP antigen (a3(TV)NCl) is 
the target of a complex folding process yielding multiple conformers. Isolated conformers 
are non-minimum energy structures specifically activated by phosphorylation for 
supramolecular aggregation and likely quaternary structure formation. In GP patients, the 
a3(IV)NCl shows conformational alterations and a reduced ability to mediate the 
disulfide stabilization of the collagen IV network. The GP antibodies, in turn, demonstrate 
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stronger affinity towards the patient <x3(IV)NCl conformers, indicating that 
conformational^ altered material caused the autoimmune response. Therefore, it seems 
that in GP disease an early alteration in the conforming process of the cc3(IV)NCl could 
generate altered conformers for which the immune system is not tolerant, thus mediating 
5 the autoimmune response. 

Other evidence (Raya et al., unpublished results) indicates that phosphorylation is 
the signal that drives the folding of the a3(IV)NCl into non-minimum energy ends. In this 
scenario, three features of the human a3(IV)NCl system are of special pathogenic 
relevance when compared to the corresponding antigen systems from species that, like 

10 bovine or murine, do not undergo spontaneous GP disease. First, the N-terminus of the 
human a3(IV)NCl contains a motif that is phosphorylatable by PKA and also by GPBP 
(see above, and also 2-4). Second, the human gene generates multiples alternative 
products by alternative ex on splicing (14,15). Exon skipping generates alternative 
products with divergent C-terminal ends that up-regulate the in vitro PKA 

15 phosphorylation of the primary cc3(IV)NCl product (See below Example 3). Third, the 
human GPBP is expressed associated with glomerular and alveolar basement membranes, 
the two main targets in GP disease. The phosphorylation-dependent conforming process is 
also a feature of non-pathogenic NCI domains (13), suggesting that the phosphorylatable 
N-terminus, the alternative splicing diversification, and the expression of GPBP at the 

20 glomerular and alveolar basement membranes, are all exclusively human features that 
place the conformation process of ct3(IV)NCl in a vulnerable condition. The four 
independent GP kidneys studied expressed higher levels of GP antigen alternative 
products (15; Bemal and Saus, unpublished results), and an augmented expression of 
GPBP were found in a GP patient (see above). Both increased levels of alternative GP 

25 antigen products and GPBP arc expected to have consequences in the phosphorylation- 
dependent confoimational process of the a3(IV)NCl, and therefore with pathogenic 
potential. 

GPBP is highly expressed in skin targeted by natural autoimmune responses. In 
the epidermis, GPBP is associated with cell surface blebs characteristic of the 
30 apoptosis-mediated differentiation process that keratinocytes undergo during 
maturation from basale to corneum strata (22, 23). Keratinocytes from SLE patients 
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show a remarkably heightened sensitivity to UV-induced apoptosis (6, 18, 20), and 
augmented and premature apoptosis of keratinocytes has been reported to exist in SLE 
and dermatomyositis (7). Consistently, we found apoptotic bodies expanding from 
basal to peripheral strata of the epidermis in several skin autoimmune conditions 

5 including discoid lupus (not shown), SLE and lichen planus. Autoantigens, and 
modified versions thereof are clustered in the cell surface blebs of apoptotic 
keratinocytes (6,18,20). Apoptotic surface blebs present autoantigens (21), and likely 
release modified versions to the circulation (16-20). It has been suggested that the 
release of modified autoantigens from apoptotic bodies could be the immunizing event 

10 that mediates systemic autoimmune responses mediating SLE and scleroderma (18,19). 

Our evidence indicates that both GPBP and GPBPA26 are able to act in vitro as 
protein kinases, with GPBP being a more active isoform than GPBPA26. Furthermore, 
recombinant material representing GPBP or GPBPA26 purified from yeast or from 
human 293 cells contained an associated proteolytic activity that specifically degrades 

15 the a3(TV)NCl domain (unpublished results). The proteolytic activity operates on 
a3(IV)NCl produced in an eukaryotic expression system, but not on recombinant 
material produced in bacteria (unpublished results), indicating that a3(IV)NCl 
processing has some conformational or post-translational requirements not present in 
prokaryotic recombinant material. Finally, it has been reported that several autoantigens 

20 undergo phosphorylation and degradation in apoptotic keratinocytes (20). While not 
being limited to an exact mechanism, we propose, in light of all of the above data, that 
the machinery assembling GPBP at the apoptotic blebs likely performs a complex 
modification of the autoantigens that includes phosphorylation, conformational changes 
and degradation. Accordingly, recombinant protein representing autoantigens in SLE 

25 (PI ribosomal phosphoprotein and Srn-Dl small nuclear ribonucleoproteins) and in 
dermatomyositis (hystidil-tRNA synthetase) were in vitro substrates of GPBP 
(unpublished results). 

The down-regulation in cancer cell lines of GPBP, suggest that the cell 
machinery harboring GPBP/GPBPA26 is likely involved in signaling pathways 
30 inducing programmed cell death. The corresponding apoptotic pathway could be up 
regulated during autoimmune pathogenesis to cause an altered antigen presentation in 
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individuals carrying specific MHC haplotypes; and down regulated during cell 
transformation to prevent autoimmune attack to the transformed cells during tumor 
growth. 
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Example 3. Regulation of Human Autoantigen Phosphorylation by Exon Splicing 
INTRODUCTION 

In GP disease, the immune system attack is mediated by autoantibodies against the 
non-collagenous C-terminal domain (NCI) of the a3 chain of collagen IV (the GP 
antigen) (1). The N-terminus of the human a3(IV)NCl contains a highly divergent and 
hydrophilic region with a unique structural motif, KRGDS 9 , that harbors a cell adhesion 
signal as an integral part of a functional phosphorylation site for type A protein kinases 
(2,3). Furthermore, the gene region encoding the human GP antigen characteristically 
generates multiple mRNAs by alternative exon splicing (4,5). The alternative products 
diverge in the C-terminal ends and all but one share the N-terminal KRGDS 9 (4,5). 

Multiple sclerosis (MS) is an exclusive human neurological disease characterized 
by the presence of inflamatory demyelization plaques at the central nervous system. (6). 
Several evidences indicate that this disease is caused by an autoimmune attack mediated 
by cytotoxic T cells towards specific components of the white matter including the myelin 
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basic protein (MBP) (7, 8). In humans, the MBP gene generates four products (MBP, 
MBP ATI, MBPAV and MBPAII/V) that result from alternative exon splicing during pre- 
mRNA processing (9). Among these, MBPAII is the more abundant form in the mature 
central nervous system, while MBP form containing all the exons is virtually absent (9). 
5 Several biological similarities exist between the autoirnme responses mediating 

GP disease and MS, namely: 1) both are human exclusive diseases and typically initiate 
after a viral flu-like disease; 2) a strong linkage exists to the same haplotype of the HLA- 
DR region of the class II MHC; 3) several products are generated by alternative splicing; 
and 4) the death of a MS patient by GP disease has recently been reported (10). 

10 

MATERIALS AND METHODS 

Synthetic polymers: GPAIII derived peptide, QRAHGQDLDALFVKVLRSP (SEQ 
ID NO:43) and GPAIII/IV/V derived peptide, QRAHGQDLESLFHQL (SEQ ID^ 
NO:44) were synthesized using either Boc- (MedProbe) or Fmoc- (Chiron, Lipotec) 
15 chemistry. 

Plasmid construction and recombinant expression. 

GP derived material: The constructs representing the different GP-spliced 
forms were obtained by subcloning the cDNAs used elsewhere to express the 

20 corresponding recombinant proteins (5) into the BamHI site of a modified pET15b 
vector, in which the extraneous vector-derived amino-terminal sequence except for the 
initiation Met was eliminated. The extra sequence was removed by cutting the vector 
with Ncol and Bam HI, filling-in of the free ends with Klenow, and re-ligation. This 
resulted in the reformation of both restriction sites and placed the BamHI site 

25 immediately downstream of the codon for the amino-terminal Met. 

The recombinant proteins representing GP or GPAV (SEQ ID NO:46) were 
purified by precipitation (5). Bacterial pellets containing the recombinant proteins 
representing GPAIII (SEQ ID NO:48) or GPAIII/IV/V (SEQ ID NO:50) were dissolved 
by 8 M urea in 40 mM Tris-HCl pH 6.8 and sonication. After centrifugation at 40,000 x 

30 g the supematants were passed through a 0.22 jam filter and applied to resource Q 
column for FPLC. The effluent was acidified to pH 6 with HC1 and applied to a 
resource S column previously equilibrated with 40 mM MES pH 6 for a second FPLC 
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purification. The material in the resulting effluent was used for in vitro 
phosphorylation. 

MBP-derived material: cDNA representing human MBPAII (SEQ ID NO:51) 
was obtained by RT-PCR using total RNA from central nervous system. The cDNA 
representing human MBP was a generous gift from C. Campagnoni (UCLA). Both 
fragments were cloned into a modified version of pHIL-D2 (Invitrogen) containing a 
6xHis-coding sequence at the C-terminus to generate pHIL-MBPAII-His and pHIL- 
MBP-His, respectively. These plasmids were used for recombinant expression in Pichia 
pastons as described in (12). Recombinant proteins were purified using immobilized 
metal affinity chromatography (TALON resin, CLONTECH) under denaturant 
conditions (SM urea) and eluted with 300 mM imidazole following manufacturers' 
instructions. The affinity-purified material was then renatured by dilution into 80 
volumes of 50 mM Tris-HCl pH 8.0, 1 0 mM CHAPS, 400 mM NaCl, 2 mM DTT, and 
concentrated 50 times by ultrafiltration through a YMlO-type membrane (AMICON). 
The Ser to Ala mutants were produced by site-directed mutagenesis over native 
sequence-containing constructs using transformer mutagenesis kit from CLONTECH 
and the resulting proteins were similarly produced. 

Phosphorylation studies. Phosphorylation studies were essentially done as 
described above (see also 3 and 12). In some experiments, the substrates were in-blot 
renatured and then, phosphorylated for 30 min at room temperature by overlaying 100 
|il of phosphorylation buffer containing 0.5 |ig of rGPBP. Digestion with V8 
endopeptidasc and immunoprecipitation were performed as described in (3). 

Antibody production. Synthetic peptides representing the C-terminal divergent 
ends of GPAIII or GPAIII/IV/V comprised in SEQ ID NO:43 or SEQ ID NO:44 
respectively were conjugated to a cytochrome C, BSA or ovoalbumine using a 
glutaraldehyde coupling standard procedure. The resulting protein conjugates were 
used for mouse immmunization to obtain polyclonal antibodies specific for GPAIII and 
monoclonal antibodies specific for GPAIII/IV/V (Mabl53). To obtain monoclonal 
antibodies specific for GPAV (MabSA) mouse were immunized using recombinant 
bacterial protein representing the corresponding alternative form comprising the SEQ 
ID NO. 50. The production of monoclonal (M3/1, Pl/2) or polyclonal (anti-GPpepl) 
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antibodies against SEQ ID NO: 26 which represents the N-terminal region of the GP 
alternative forms have been previously described (3,5). 
Boc-based peptide synthesis. 

Assembling .The peptide was assembled by stepwise solid phase synthesis using 
a Boc-Benzyl strategy. The starting resin used was Boc-Pro-PAM resin (0.56 meq/g, 
batch R4108). The deprotection /coupling procedure used was: TFA (lxlmin) TFA (lx 
3 min) DCM (flow flash) Isopropylalcohol (lx 30 sec) DMF (3 x 1 min) 
COUPLING/DMF (1 xlO min) DMF (lxl min) COUPLING/DMF (lx 10 min) DMF 
(2\ lmin) DCM (lx lmin). For each step 10 ml per gram of peptide-resin were used. 
The coupling of all amino acids (fivefold excess) was performed in DMF in the 
presence of BOP, Hobt and DIEA. For the synthesis the following side-chain protecting 
groups were used: benzyl for serine; 2 chlorobenzyloxycarbonyl for lysine; cyclohexyl 
tor aspartic and glutamic acid; tosyl for histidine and arginine. 

Cleavage. The peptide was cleaved from the resin and fully deprotected by a 
treatment with liquid Hydrogen Fluoride (HF): Ten milliliters of HF per gram of 
peptide resin were added and the mixture kept at 0° C for 45 min in the presence of p- 
cresol as scavengers. After evaporation of the HF, the crude reaction mixture is washed 
\\ nh ether, dissolved in TFA, precipitated with ether and dried. 

Purification. Stationary phase: Silica CI 8, 15 |am, 120 A; Mobile phase: solvent 
V water 0. 1% TFA and solvent B: acetonitrile /A, 60/40 (v/v); Gradient: linear from 20 
to (in 0 ,, B in 30 min; Flow rate: 40 ml/min; and detection was U.V (210 nm). Fractions 
with a purity higher than 80% were pooled and lyophilized. Control of purity and 
identit\ was performed by analytical HPLC and ES/MS. The final product had 88% 
punty and an experimental molecular weight of 2192.9. 

Fmoc-based peptide synthesis. 

Assembling. The peptides were synthesized by stepwise linear solid phase on 
Pro-clorolrityl-resin (0.685 meq/g) with standard Fmoc/tBu chemistry. The 
deprotection /coupling procedure used was: Fmoc aa (0.66 g) HOBt (0.26 g) DIPCDI 
(0.28 ml) for 40 min following a control by Kaiser test. If the test was positive the time 
was extended until change to negative. Then DMF (31 min), piperidine/DMF 20% (1 1 
min) piperidine/DMF 20% (15 min) and DMF (41 min). Side chain protectors were: 
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Pmc (pentamethylcromane suifonyl) for arginine, Bcc (tert-butoxycarbonyl) for lysine, 
tBu (tert-butyl) for aspartic acid and for serine and Trl (trityl) for histidine. 

Cleavage, The peptide was cleaved and fully deprotected by treatment cleavage 
with TF A/water 90/10. Ten milliliters of TFA solution per gram of resin were added. 
5 Water acts as scavenger. After two hours, resin was filtered and the resulting solution 
was precipitated five times with cold diethylether. The final precipitated was dried. 

Purification. Stationary phase: Kromasil CI 8 10 (am; Mobile phase: solvent A: 
water 0.1% TFA and solvent B: acetonitrile 0.1% TFA; Isocratic: 28% B; Flow rate: 55 
ml/min; Detection: 220 nm. Fractions with the higher purity were pooled and 
10 lyophilized, and a second HPLC purification round performed. Control of purity and 
identity was performed by analytical HPLC and ES/MS. The final product had 97% 
purity and an experimental molecular weight of 2190.9. 

RESULTS 

15 Regulation of the phosphorylation of the human GP antigen by alternative 

splicing. We produced bacterial recombinant proteins representing the primary antigen 
(GP) or the individual alternative products GPAV (SEQ ID NO:46), GPAIII (SEQ ID 
NO:48) and GPAIII/IV/V (SEQ ID NO:50), and we tested their ability to be 
phosphorylated by PKA (Figure 16, left panel ). Using standard ATP concentrations (150 
20 ^iM), all four recombinant antigens were phosphorylated but to very different extents. The 
alternative forms incorporated 32 P more efficiently than the primary GP antigen, 
suggesting that they are better substrates. Because these antigens are expected to be in the 
extracellular compartment, we also assayed their phosphorylatability with more 
physiological ATP concentrations (0.1-0.5 jaM). Under these conditions, the differences in 

25 32 P incorporation between the primary and alternative products were more evident, 
indicating that at low ATP concentrations the primary GP antigen was a very poor 
substrate for the kinase. Among the three PKA phosphorylation sites present in the GP 
antigen, the N-terminal Ser 9 and Ser 26 are the major ones, and are common to all the 
alternative products assayed (3,5). Accordingly, the differences observed in 

30 phosphorylation for the full polypeptides also existed among the individual N-terminal 
regions, as determined after specific V8 digestion and immunoprecipitation (not shown). 
This strongly suggests that differences in phosphorylation might be due to the presence of 
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different C-terminal sequences in the alternative products. Since GPAIII and GPAIILTWV 
displayed significantly higher 32 P incorporation rates than GPAV, and they have shorter 
divergent C-terminal regions (5), we used synthetic peptides individually representing 
these C-terminal sequences (SEQ ID NO: 43, SEQ ID NO:44) to further examine their 

5 regulatory roles in the in vitro phosphorylation of the native antigen. Collagen IV is a 
trimeric molecule comprised of three interwoven a chains. In basement membranes, two 
collagen IV molecules assemble through their NCI domains to yield a hexameric NCI 
structure that can be solubilized by bacterial collagenase digestion (1). Dissociation of the 
hexamer structure releases the GP antigen in monomelic and disulfide-related dimeric 

10 forms (1). For the following set of experiments, we carried out phosphorylations in the 
presence of low, extracellular-like ATP concentrations using both monomelic or 
hexameric native GP antigen (Figure 16, right panel ). The presence of each specific 
peptide but not control peptides (not shown) induced the phosphorylation of a single 
polypeptide displaying an apparent MW of 22 kDa. By specific V8 digestion and 

15 immunoprecipitation, the corresponding polypeptide has been identified as the 22 kDa 
conformer of the a3(IV)NCl, previously characterized and identified as the best substrate 
for the PKA(ll). 

Regulation of the phosphorylation of the MBP by alternative splicing. The 

MBP contains at its N terminal region two PKA phosphorylation sites (Ser 8 , Ser 57 ) that are 
20 structurally similar to the N terminus site (Ser 9 ) present in GP antigen products (Fig 17). 
The Ser* site present in all the MBP proteins is located in a similar position than the Ser 9 
in the GP-derived polypeptides. In addition, in the MBP and GPAIII Ser 8 and Ser 9 
respectively are at a similar distance in the primary structures of a highly homologous 
motif present in the corresponding exon II (bend arrow in Fig 17). The GPAIII-derived 
25 motif coincides with the C terminal divergent region that up-regulates PKA 
phosphorylation of Ser° in the GP antigen system (Fig. 16). The regulatory-like sequence 
in MBP is located at exon II and its presence in the final products depends on an 
alternative exon splicing mechanism. Therefore, the MBP motif identified by structural 
comparison to GPAIII may be also regulating PKA phosphorylation of Ser 8 . We produced 
30 recombinant proteins representing MBP and MBPAII (SEQ ID NO:54) and the 
corresponding Ser to Ala mutants to knock-out each of the two PKA phosphorylation sites 
(Ser 8 and Ser 57 ) present in exon I. Subsequently, we assessed its in vitro phosphorylation 
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by PKA (Fig. 18). MBPAII was a better substrate than MBP, and Ser 8 was the major 
phosphorylation site, indicating that, similarly to GP antigenic system, alternative exon 
splicing regulates the PKA phosphorylation of specific sites located at the N-terminal 
region common to all the MBP-derived alternative forms. 

5 In similar experiments assessing GPBP phosphorylation of the recombinant MBP 

proteins, GPBP preferentially phosphorylated MBP, while little phosphorylation of 
MBPAII was observed (Fig. 19). Furthermore, recombinant Ser to Ala mutants displayed 
no significant reduction in 32 P incorporation, indicating that GPBP phosphorylates 
MBP/MBPAII in an opposite way than PKA, and that these two kinases do not share 

I « i major phosphorylation sites in MBP proteins. 

From all these data we concluded that in the MBP system, alternative splicing 
regulates the phosphorylation of specific serines by either PKA or GPBP. 

Synthetic peptides representing the C terminal region of GPAIII influence 
GPBP phosphorylation. To assess the effect of the C terminal region of GPAIII on 
GPBP activity, peptides representing this region were synthesized using two different 
chemistries (Boc or Fmoc), and separately added to a phosphorylation mixture 
containing GPBP (Fig. 20). Boc-based synthetic peptides positively influenced GPBP 
autophosphorylation while Fmoc-based inhibited GPBP autophosphorylation, 
suggesting that the regulatory sequences derived from the alternative products in either 
:<> GP and MBP antigenic systems can influence the kinase activity of GPBP. 

DISCUSSION 

We have shown that the a3(IV)NCl domain undergoes a complex structural 
diversification by two different mechanism: 1) alternative splicing (4,5) and 2) 

'25 conformational isomerization of the primary product (11). Both mechanisms generate 
products that are distinguished by PKA, indicating that PKA phosphorylation is a 
critical event in the biology of the a3(IV)NCl domain. Phosphorylation guides at least 
in part the folding, but also the supramolecular assembly of the oc3(IV)NCl domain in 
the collagen IV network (1 1 and Raya et al. unpublished results). Altered conformers of 

30 the a3(IV)NC1 lead the autoimmune response mediating GP disease (11), suggesting 
that an alteration in antigen phosphorylation could be the primary event in the onset of 
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the disease. Accordingly, we have found increased expression levels of GPAIII in 
several GP kidneys (4 and Bernal and Saus, unpublished results), and an increased 
expression of GPBP has been detected in another Goodpasture patient (Fig. 15). Both 
increased expression of alternative GP antigen products and of GPBP are expected to 

5 have consequences in the phosphorylation steady state of a3(IV)NCl, and therefore in 
the corresponding conformational process. The discrimination among the different 
structural products by PKA strongly suggests that this kinase, or another structurally 
similar kinase, is involved in the physiological antigen conforming process, and that 
antigen phosphorylation by GPBP has a pathogenic significance. In pathogenesis, 

10 GPBP could be an intruding kinase, interfering in the phosphorylation-dependent 
conforming process. Accordingly, GPBP is expressed in tissue structures that are 
targeted by natural autoimmune responses, and an increased expression of GPBP is 
associated with several autoimmune conditions (See examples 1 and 2 above). 

An alternative splicing mechanism also regulates the PKA phosphorylation of 

15 specific serines in the MBP antigenic system. MBP is also a substrate for GPBP 
suggesting that GPBP may play a pathogenic role in multiple sclerosis, and other 
autoimmune responses. 

All of the above data identify GPBP as a potential target for therapeutics in 
autoimmune disease. In Fig 20, we show that synthetic peptides representing the C 

20 terminal region of GPAIII (SEQ ID NO:43) modulate the action of GPBP in vitro , and 
therefore we identified this and related sequences as peptide-based compounds to 
modulate the activity of GPBP in vivo . The induction of GP antigen phosphorylation 
by PKA was achieved when using Boc-based peptides, but not when using similar 
Fmoc-based peptides. Furthermore, Boc- but not Fmoc-based peptides were in vitro 

25 substrates of PKA (not shown), indicating that important structural differences exist 
between both products. Since both products displayed no significant differences in 
mass spectrometry, one possibility is that the different deprotection procedure used may 
be responsible for conformational differences in the secondary structure that may be 
critical for biological activity. Accordingly, Boc-based peptide loses its ability to 

30 induce PKA upon long storage at low temperatures. 
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The present invention is not limited by the aforementioned particular preferred 
embodiments. It will occur to those ordinarily skilled in the art that various 
30 modifications may be made to the disclosed preferred embodiments without diverting 
from the concept of the invention. All such modifications are intended to be within the 
scope of the present invention. 
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I claim: 

1. An isolated nucleic acid sequence comprising a sequence substantially similar 
to a nucleic acid sequence selected from the group consisting of SEQ ID NO:l, SEQ ED 

5 NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:l 1, SEQ ID NO:13, 
SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, SEQ ID NO:23, and 
SEQ ID NO:25. 

2. An isolated nucleic acid sequence comprising a nucleic acid sequence selected 
10 from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5 ? SEQ ID 

NO:7, SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, 
SEQ ID NO: 1 9, SEQ ID NO:2 1 , SEQ ID NO:23, and SEQ ID NO:25. 

3. An isolated nucleic acid comprising a sequence that encodes a polypeptide . 
15 selected from the group consisting of GPBP, GPBPA26, and GPBPpepl, or fragments 

thereof. 

A. An isolated nucleic acid sequence comprising a sequence that encodes a protein 
sequence substantially similar to a protein sequence selected from the group consisting 
20 of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID 
NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID 
NO:22, and SEQ ID NO:24 

5. An isolated nucleic acid sequence comprising a sequence that encodes a protein * 
25 sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID 

NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID N0.12, SEQ ID NO:14, SEQ ID NO:16, 
SEQ ID NO: 18, SEQ ID NO:20, SEQ ED NO:22, and SEQ ID NO:24. 

6. A recombinant expression vector comprising the isolated nucleic acid sequence 
30 of any one of claims 1-5. 
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7. A recombinant expression vector comprising an isolated nucleic acid sequence 
comprising a nucleic acid sequence selected from the group consisting of SEQ ID 
NO: 1 , SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 1 1 , SEQ 
ID NO: 13, SEQ ID NO:15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, SEQ ID 
NO:23, and SEQ ID NO:25, or fragments thereof 

8. A host cell transfected with the recombinant expression vector of claim 6 or 7. 

9. A substantially purified polypeptide, comprising an amino acid sequence 
substantially similar to a sequence selected from the group consisting of SEQ ID NO:2, 
SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ED 
NO:14, SEQ ID NO:16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID 
NO:24, or peptide fragments thereof 

10. A substantially purified polypeptide, comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, 
SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID 
NO: 1 8, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, or peptide fragments thereof. 

11. A substantially purified protein comprising a polypeptide selected from the 
group consisting of GPBP, GPBPA26, and GPBPpepl, or peptide fragments thereof. 

12. An antibody that selectively binds to the substantially purified protein or 
polypeptide of any one of claims 9-11. 

13. The antibody of claim 12, wherein the antibody is a polyclonal antibody. 

14. The antibody of claim 12, wherein the antibody is a monoclonal antibody. 

15. A method for detecting the presence of a protein that is substantially similar to a 
protein selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID 
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NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO:12, SEQ ID NO: 14, SEQ ID NO:16, 
SEQ ID NO: 1 8, SEQ ED NO:20, SEQ ID NO:22, SEQ ID NO:24, comprising 
a) providing a protein sample to be screened; 

h) contacting the protein sample to be screened with the antibody of any 
5 one of claims 12-14 under conditions that promote antibody-antigen complex 
formation; and 

c) detecting the formation of antibody-antigen complexes, wherein the 
presence of the antibody-antigen complex indicates the presence of a protein that is 
substantially similar to a protein selected from the group consisting of SEQ ED NO:2, 
10 SEQ) ID NO:4, SEQ ID NO:6, SEQ ID NO:S, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID 
NO:14. SEQ ID NO: 16, SEQ ID NO:lS. SEQ ID NO:20, SEQ ID NO:22, SEQ ID 
NO:24 

l<>. The method of claim 15, wherein detecting comprises a method selected from 
15 the group consisting of immunolocalization, immunofluorescence analysis, Western 
blot analysis, ELISAs, and nucleic acid expression library screening. 

17. A method for detecting in a sample a sequence that is substantially similar to a 
nucleic acid selected from the group consisting of SEQ ID NO:l, SEQ ED NO:3, SEQ 

20 ID N< ):5. SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:l i, SEQ ID NO:13, SEQ ID NO:15, 
SEQ ID NO: 17. SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:23, or SEQ ID NO:25, 
comprising contacting the sample with the isolated nucleic acid of any one of claims 1- 
5. or fragments thereof, and detecting complex formation, wherein complex formation 
indicates the presence in the sample of the sequence that is substantially similar to a 

25 nucleic acid selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ 
ID N( >:5. SliQ ID NO:7, SEQ ID NO:9, SEQ ID NO:l 1, SEQ ID NO:13, SEQ ID NO:15, 
SEQ ID NO: 1 7, SEQ ID NO: 19, SEQ ID NO:21, SEQ ID NO:23, or SEQ ID NO:25. 

IS. The method of claim 17, wherein the detecting is carried out by a method 
30 selected from the group consisting of hybridization, reverse transcription, PCR, coupled 
reverse transcription-PCR, Northern blotting, Southern blotting, and DNA library 
screening. 
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19. A method for detecting an autoimmune condition in a patient, comprising 

-providing a tissue or body fluid sample from the patient; 

-providing a control tissue or body fluid sample in which no autoimmune 
5 condition is present; and 

-detecting altered GPBP RNA or protein expression in the tissue or body fluid 
sample compared to the control sample, wherein an alteration in GPBP RNA or protein 
expression relative to the control indicates the presence of an autoimmune condition. 

10 20. A method for detecting ceils undergoing apoptosis or cancer transformation in a 
tissue or body fluid sample, comprising 

-providing a tissue or body fluid sample from the patient; 
-providing a normal control tissue or body fluid sample; and 
-detecting altered GPBP RNA or protein expression in the tissue or body fluid 
1 5 sample compared to the control sample, wherein an alteration in GPBP RNA or protein 
expression relative to the control indicates the presence of cells undergoing apoptosis or 
cancer transformation. 

21. A method for treating a patient with an autoimmune disorder, comprising 
20 modifying the expression or activity of GPBP, GPBPA26, or a protein comprising a 

polypeptide substantially similarly to GPBPpepl in the patient with the autoimmune 
disorder. 

22. A method for treating a patient with a tumor, comprising modifying the 
25 expression or activity of GPBP, GPBPA26, or a protein comprising a polypeptide 

substantially similarly to GPBPpepl in the patient with the tumor. 

23. A method for preventing cell apoptosis, comprising modifying the expression or 
activity of GPBP, GPBPA26, or a protein comprising a polypeptide substantially 

30 similarly to GPBPpepl in the cell. 
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24. The method of claim 21, 22, or 23 wherein alternative products of the 
Goodpasture antigen or of the myelin basic protein are used to modify the expression or 
activity of GPBP, GPBPA26 or a protein comprising a polypeptide substantially 
similarly to GPBPpepl . 

5 

25. The method of claim 21, 22, or 23 wherein nucleic acids comprising sequences 
substantially similar to SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49, SEQ ID NO: 
5 1, or SEQ ID NO:53 or fragments thereof are used to modify the expression or activity 
of GPBP, GPBPA26 or a protein comprising a polypeptide substantially similarly to 

10 GPBPpepl. 

26. The method of claim 21, 22, or 23 wherein polypeptides comprising sequences 
substantially similar to SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:46, SEQ ID 
NO:4S, SEQ ID NO:50, or SEQ ID NO:54, or fragments thereof are used to modify the r . 

15 expression or activity of GPBP, GPBPA26 or a protein comprising a polypeptide 
substantially similarly to GPBPpepl . 

27. An isolated nucleic acid sequence comprising a sequence that encodes a 
polypeptide substantially similar to an amino acid sequence selected from the group 

20 consisting of SEQ ID NO:43, SEQ ID NO:44, or peptide fragments thereof 

28. An isolated nucleic acid sequence comprising a sequence that encodes a 
polypeptide selected from the group consisting of SEQ ID NO:43, SEQ ED NO:44, and 
peptide fragments thereof. 

25 

29. A recombinant expression vector comprising the isolated nucleic acid sequence 
of claim 27 or 28. 

30. A host cell transfected with the recombinant expression vector of claim 29. 

30 
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31. A substantially purified polypeptide, comprising an amino acid sequence 
substantially similar to a sequence selected from the group consisting of SEQ ID 
NO:43, SEQ ID NO:44, or peptide fragments thereof 

5 32. A substantially purified polypeptide, comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO:43, SEQ ID NO:44, or peptide 
fragments thereof. 

33. An antibody that selectively binds to the substantially purified protein or 
10 polypeptide of claim 31 or 32. 

34. The antibody of claim 33, wherein the antibody is a polyclonal antibody. 

35. The antibody of claim 33, wherein the antibody is a monoclonal antibody. 

15 

30. The method of claim 21, 22, or 23 comprising administering a substantially 
purified polypeptide substantially similar to a polypeptide selected from the group 
consisting of SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ 
ID NO:5(>, or SEQ ID NO:54, or fragments thereof, to modify the expression or activity 
20 of GPBI\ GPBPA26, or a protein comprising a polypeptide substantially similarly to 
GPBPpepl. 

37. The method of claim 21, 22, or 23 comprising administering an isolated nucleic 
acid comprising sequences substantially similar to SEQ ID NO:45, SEQ ID NO:47, 

25 SEQ ID NO:49, SEQ ID NO: 51, or SEQ ID NO:53 or fragments thereof, or fragments 
thereof, to modify the expression or activity of GPBP, GPBPA26, or a protein 
comprising a polypeptide substantially similarly to GPBPpepl. 

38. A pharmaceutical composition, comprising an amount effective of a 
30 substantially purified polypeptide substantially similar to a polypeptide selected from 

the group consisting of SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:46, SEQ ID 
NO:4S, SEQ ID NO:50, or SEQ ID NO:54, or fragments thereof, to modify the 
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expression or activity of GPBP, GPBPA26, or a protein comprising a polypeptide 
substantially similarly to GPBPpepl , and a pharmaceutically acceptable carrier. 

39. A pharmaceutical composition, comprising an amount effective of a an isolated 
nucleic acid comprising sequences substantially similar to SEQ ID NO:45, SEQ ID 
NO:47, SEQ ID NO:49, SEQ ID NO: 51, or SEQ ID NO:53 or fragments thereof, to 
modify the expression or activity of GPBP, GPBPA26, or a protein comprising a 
polypeptide substantially similarly to GPBPpepl, and a pharmaceutically acceptable 
carrier. 

40. The method of claim 21, 22, or 23 comprising administering the pharmaceutical 
composition of claim 38 or 39 to modify the . expression or activity of GPBP, 
GPBPA26, or a protein comprising a polypeptide substantially similarly to GPBPpepl. 
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GCAGGAAGATGGCGGCGGTAGCGGAGGTGTGAGTGGACGCGGGACTCAGCGGCCGGATTTTCTCTTCCCT 7 0 

TCTTTTCCCTTTTCCTTCCCTATTTGAAATTGGCATCGAGGGGGCTAAGTTCGGGTGGCAGCGCCGGGCG 14 0 

CAACGCAGGGGTCACGGCGACGGCGGCGGCGGCTGACGGCTGGAAGGGTAGGCTTCATTCACCGCTCGTC 210 

CTCCTTCCTCGCTCCGCTCGGTGTCAGGCGCGGCGGCGGCGCGGCGGGCGGACTTCGTCCCTCCTCCTGC 2 3 0 

TCCCCCCCACACCGGAGCGGGCACTCTTCGCTTCGCCATCCCCCGACCCTTCACCCCGAGGACTGGGCGC 3 5 0 

CTCCTCCGGCGCAGCTGAGGGAGCGGGGGCCGGTCTCCTGCTCGGTTGTCGAGCCTCCATGTCGGATAAT 4 2 0 

M S p N 4 

CAGAGCTGGAACTCGTCGGGCTCGGAGGAGGATCCAGAGA CGGAGTCTGGGCCGCCTGTGGAGCGCTGCG 4 90 

Q... W__N._S_..jS_G__S_ 27 

GGGTCCTCAGTAAGTGGACAAACTACATTCATGGGTGGCAGGATCGTTGGGTAGTTTTGAAAAATAATGC 5 50 

G V .Xi S K W T N Y I H G W .Q....JP. ^ W Y_..„Y ?f K Sl_.-JH A 51 

TCTGAG TTA CTACAAA TCTGAAGA TGAAACAGAGTA TGGCTGCAGAGGA TCCA TCTGTCXTAGCAAGGCT 63 0 

L S Y Y K „ .S E D E T E Y ...!?... .G .._ .R G S X C . S K_ A 74 

GTCA TCA CA C C TCA C GA TTTTGA TGAA TGTCGA TTTGATATTAGTGTAAATGATAGTGTTTGGTA TCTTC 7 0 0 

GTG CTCAGGA* TCCAGA TCATAGACAGCAATGGATAGATGCCATTGAACAGCACAAGACTGAATCTGGA TA 77 0 

5... Jk...Jft. R....Q.....Q.....W ?....A....?.....E....Q„.._H __K_,_,T E S G Y 121 

TGGATCTGAATCCAGCTTGCGTCGACATGGCTCAATGGTGTCCCTGGTGTCTGGAGCAAGTGGCTACTCT 8 4 0 

GSESSLRRHG SMVSLVSGASGYS 144 

GCAA CATCCACCTCTTCA TTCAAGAAAGGC CACAGTTTAC GTGAGAAGTTG GCTG AAATGGAAAC ATTTA 910 

ATSTSSFK KGH S LR E K L A EM ET F 167 

GAGACATCTTATGTAGACAAGTTGACACGCTAGAGAAGTACTTTGATGCCTGTGCTGATGCTGTCTCTAA 9 8 0 

RD I LCRQVDTLQKYFDACADAVS K 191 

GGATGAACTTCAAAGGGATAAAGTGGTAGAAGATGATGAAGATGACTTTCCTACAACGCGTTCTGATGGT 10 5 0 

DELQRDKVVEDDEDDFPTTRSDG 214 

G ACTTCTTG CATAGTAC CAACGGCAATAAAGAAAAGTTATTTCCACATGTGAC ACCAAAAGG AATTAATG 112 0 

DFLHSTNGNKEKLFPHVTPKGIN 237 

GTATAGACTTTAAAGGGGAAGCGATAACTTTTAAAGCAACTACTGCTGGAATCCTTGCAACACTTTCTCA 1190 

GI D FKGEAITFKATTAG I LATLSH 261 

TTGT ATTG AACT AATGGTTAAACGTGAGGACAGCTGGCAGAAGAGACTGGATAAGG AAACTGAGAAGAAA 12 60 

C I ELMVKRED SWQKRLPKETEKK 284 

AGAAGAACAGAGGAAGCATATAAAAATGCAATGACAGAACTTAAGAAAAAATCCCACTTTGGAGGACCAG 13 3 0 
R R T E B A YKNAMTSL K K K S HF G G P 307 

ATTATG AAGAAGGC C CTAACAGTCTGATTAATGAAGAAGAGTT CTTTGATG CTGTTG AAGCTG CTCTTGA 14 0 0 
DY E EG'PNSL I NEEE FFDAVEAALD 331 
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CAGACAAGATAAAATAGAAGAACAGTCACAGAGTGAAAAGGTGAGATTACATTGGCCTACATCCTTGCCC 14 7 0 
RQDKI2SQSQSEKVRLHWPTSLP 354 

TCTGGAGATGCCTTTTCTTCTGTGGGGACACATAGATTTGTCCAAAAGCCCTATAGTCGCTCTTCCTCCA 154 0 
SGDAFSSVGTHRFVQK P Y S R S S S 377 

TGTCTTCCATTGATCTAGTCAGTGCCTCTGATGATGTTCACAGATTCAGCTCCCAGGTTGAAGAGATGGT 1610 
MSSIDLVSASDDVHRFSSO V E E M V 401 

GCAGAACCACATGACTTACTCATTACAGGATGTAGGCGGAGATGCCAATTGGCAGTTGGTTGTAGAAGAA 168 0 
QNHMTYSLQDVGGDANWQLVVEE 424 

GG AGAAATG AAGGT ATACAG AAGAG AAGT AGAAGAAAATGGGATTGTTCTGGATC CTTTAAAAGCTACC C 17 5 0 
GEMKVYRREVEENGIVLDPLKAT 447 

ATGCAGTTAAAGGCGTCACAGGACATGAAGTCTGCAATTATTTCTGGAATGTTGACGTTCGCAATGACTG 18 2 0 
HAVKGVTGHE VCNYFWNVDVRNDW 471 

GGAAACAACTATAGAAAACTTTCATGTGGTGGAAACATTAGCTGATAATGCAATCATCATTTATCAAACA 18 90 
ETTI ENFKVVETLADNAII IYQT 494 

CACAAGAGGGTGTGGCCTGCTTCTCAGCGAGACGTATTATATCTTTCTGTCATTCGAAAGATACCAGCCT 196 0 
H K R V W ? A S Q R D V L Y I, S V I R K I ? A 517 

TGACTGAAAATGACCCTGAAACTTGGATAGTTTGTAATTTTTCTGTGGATCATGACAGTGCTCCTCTAAA 2 0 3 0 
LTE ND PETW I VCNFSVDHDSAP L N r 541 

CAACCGATGTGTCCGTGCCAAAATAAATGTTGCTATGATTTGTCAAACCTTGGTAAGCCCACCAGAGGGA 210 0 
NRCVRAKItfVAMICQTLVSPPEG 564 

AACCAGGAAATT AGCAGGGACAAC ATTCTATGCAAGATTACATATGTAGCTAATGTGAACCCTGGAGGAT 217 0 
NQEI S RDM I LCKITYVANVNPGG 587 

GGGCACCAGCCTCAGTGTTAAGGGCAGTGGCAAAGCGAGAGTATCCTAAATTTCTAAAACGTTTTACTTC 2 2 4 0 
WAPASVLRAVAKREYPKFLKRFTS 611 

TTACGTCCAAGAAAAAACTGCAGGAAAGCCTATTTTGTTCTAGTATTAACAGGTACTAGAAGATATGTTT 2 310 
YVQEKTAGKPIL? 624 

TATCTTTTTTTAACTTTATTTGACTAATATGACTGTCAATACTAAAATTTAGTTGTTGAAAGTATTTACT 2 3 80 
ATGTTTTTT 2 3 8 9 
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SEQUENCE LISTING 

<110> Saus, Juan 

<120> Goodpasture Binding Protein 

<130> 98-723-B 

<14C> To Be Assigned 
<141> Filed Herewith 

<160> 54 

<170> Patentln Ver. 2.0 

<210> 1 
<2il> 2389 

<2i2> d:;a 

<213> Homo sapiens 

< 2 2 C > 
<221> CDS 

•:221 409> . . (2280) 
<400> : 

gcaggaaaat ggcggcggta gcggaggtgt gagtggacgc gggactcagc ggccggattt 60 

tctcctercc tcttttccct tttccttccc tatttgaaat tggcatcgag ggggctaagt 120 

tcgggcggca gcgccgggcg caacgcaggg gtcacggcga cggcggcggc ggctgacggc 18 0 

tggaagggca ggcttcattc accgctcgcc ccccttcctc gctccgctcg gtgtcaggcg 240 

cggcggcggc gcggcgggcg gacttcgtcc ctcctcctgc tcccccccac accggagcgg 300 

gcacccctrg cttcgccatc ccccgacccr ccaccccgag gactgggcgc ctcctccggc 360 

gcagccgagg gagcgggggc cggtctcctg ctcggttgtc gagcctcc atg teg gat 417 

Met Ser Asp 
1 

aat cag age tgg aac teg teg ggc teg gag gag gat cca gag acg gag 465 
Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu Thr Glu 
5 10 15 

tec ggg ccg cct gtg gag cgc tgc ggg gtc etc agt aag tgg aca aac 513 
Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp Thr Asn 
20 25 30 35 

tac act cat ggg tgg cag gat cgt tgg gta gtt ttg aaa aat aat get 561 
Tyr lit- K:s Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn Asn Ala 
40 45 50 

ctg agt tac tac aaa tct gaa gat gaa aca gag tat ggc tgc aga gga 609 
Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys Arg Gly 
55 60 65 

tec ate cgt ctt age aag get gtc ate aca cct cac gat ttt gat gaa 657 
Ser I-«r Cys Leu Ser Lys Ala Val lie Thr Pro His Asp Phe Asp Glu 
70 75 80 
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tgt cga ttt gat att agt gta aat gat agt gtt tgg tat ctt cgt get 
Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr Leu Arg Ala 
85 90 95 



705 



cag gat cca gat cat aga cag caa tgg ata gat gec att gaa cag cac 753 
Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He Glu Gin His 
100 105 . 110 115 

aag act gaa tct gga tat gga tct gaa tec age ttg cgt cga cat ggc 801 
Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg His Gly 
120 125 130 

tea atg gtg tec ctg gtg tct gga gca agt ggc tac tct gca aca tec 84 9 
Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala Thr Ser 
135 140 145 

ace tct tea ttc aag aaa ggc cac agt tta cgt gag aag ttg get gaa 897 
Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu 
150 155 160 

atg gaa aca ttt aga gac ate tta tgt aga caa gtt gac acg eta cag 945 
Met Glu Thr Phe Arg Asp. He Leu Cys Arg Gin Val Asp Thr Leu Gin 
165 170 175 

aag tac ttt gat gee tgt get gat get gtc tct aag gat gaa ctt caa 
Lys Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin 
180 185 190 195 

agg gat aaa gtg gta gaa gat gat gaa gat gac ttt cct aca acg cgt 
Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg 
200 205 210 

tct gat ggt gac ttc ttg cat agt ace aac ggc aat aaa gaa aag tta 
Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu 
215 220 225 

ttt cca cat gtg aca cca aaa gga att aat ggt ata gac ttt aaa ggg 
Phe Pro His Val Thr Pro Lys Gly He Asn Gly He Asp Phe Lys Gly 
230 235 240 

gaa gcg ata act ttt aaa gca act act get gga ate ctt gca aca ctt 1185 
Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu Ala Thr Leu 
245 250 255 

tct cat tgt att gaa eta atg gtt aaa cgt gag gac age tgg cag aag 1233 
Ser His Cys He Glu Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys 
260 265 270 275 

aga ctg gat aag gaa act gag aag aaa aga aga aca gag gaa gca tat 1281 
Arg Leu Asp Lys Glu Thr Glu Lys Lys Arg Arg Thr Glu Glu Ala Tyr 
280 285 290 



993 



1041 



1089 



1137 



aaa aat gca atg aca gaa ctt aag aaa aaa tec cac ttt gga gga cca 
Lys Asn Ala Met Thr Glu Leu Lys Lys Lys Ser His Phe Gly Gly Pro 
295 300 305 

gat tat gaa gaa ggc cct aac agt ctg att aat gaa gaa gag ttc ttt 
Asp Tyr Glu Glu Gly Pro Asn Ser Leu He Asn Glu Glu Glu Phe Phe 
310 315 320 

gat get gtt gaa get get ctt gac aga caa gat aaa ata gaa gaa cag 



1329 



1377 



1425 
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Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys lie Glu Glu Gin 
325 330 335 

tea cag agt gaa aag gtg aga tta cat tgg cct aca tec ttg ccc tct 1473 
Ser Gin Ser Glu Lys Val Arg Leu His Trp Pro Thr Ser Leu Pro Ser 
340 345 350 355 

gga gat gec ttt tct tct gtg ggg aca cat aga ttt gtc caa aag ccc 1521 
Gly Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val Gin Lys Pro 
360 365 370 

tat agt cgc tct tec tec atg tct tec att gat eta gtc agt gee tct 1569 
Tyr Ser Arg Ser Ser Ser Met Ser Ser lie Asp Leu Val Ser Ala Ser 
375 380 385 

gat gat gtt cac aga ttc age tec cag gtt gaa gag atg gtg cag aac 1617 
Asp Asp Val His Arg Phe Ser Ser Gin Val Glu Glu Met Val Gin Asn 
390 395 400 

cac atg act tac tea tta cag gat gta ggc gga gat gee aat tgg cag 1665 
His Met Thr Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp Gin 
405 410 415 

ttg gtt gta gaa gaa gga gaa atg aag gta tac aga aga gaa gta gaa 1713 
Leu Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg Glu Val Glu 
420 425 430 " 435 

gaa aat ggg att gtt ctg gat cct tta aaa get acc cat gca gtt aaa 1761 
Glu Asn Gly lie Val Leu Asp Pro Leu Lys Ala Thr His Ala Val Lys 
440 445 450 

ggc gtc aca gga cat gaa gtc tgc aat tat ttc tgg aat gtt gac gtt 1809 
Gly Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val Asp Val 
455 460 * 465 

cgc aat gac tgg gaa aca act ata gaa aac ttt cat gtg gtg gaa aca 1857 
Arg Asn Asp Trp Glu Thr Thr lie Glu Asn Phe His Val Val Glu Thr 
470 475 480 

tta get gat aat gca ate ate att tat caa aca cac aag agg gtg tgg 1905 
Leu Ala Asp Asn Ala lie lie lie Tyr Gin Thr His Lys Arg Val Trp 
485 490 495 

cct get tct cag cga gac gta tta tat ctt tct gtc att cga aag ata 1953 
Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Val lie Arg Lys lie 
500 505 510 515 

cca gee ttg act gaa aat gac cct gaa act tgg ata gtt tgt aat ttt 2001 
Pro Ala Leu Thr Glu Asn Asp Pro Glu Thr Trp lie Val Cys Asn Phe 
520 525 530 

tct gtg gat cat gac agt get cct eta aac aac cga tgt gtc cgt gee 2049 
Ser Val Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys Val Arg Ala 
535 540 545 

aaa ata aat gtt get atg att tgt caa acc ttg gta age cca cca gag 2097 
Lys lie Asn Val Ala Met lie Cys Gin Thr Leu Val Ser Pro Pro Glu 
550 555 560 

gga aac cag gaa att age agg gac aac att eta tgc aag att aca tat 2145 
Gly Asn Gin Glu lie Ser Arg Asp Asn lie Leu Cys Lys lie Thr Tyr 
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565 570 575 

gta get aat gtg aac cct gga gga tgg gca cca gec tea gtg tta agg 2193 
Val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser Val Leu Arg 
530 585 590 595 

gca gtg gca aag cga gag tat, cct aaa ttt eta aaa cgt ttt act tct 2241 
Ala Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr Ser 
600 605 . 610 

tac gtc caa gaa aaa act gca gga aag cct att ttg ttc tagtattaac 2290 
Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro lie Leu Phe 
615 620 

aggtactaga agatatgttt tatctttttt taactttatt tgactaatat gactgtcaat 2350 

actaaaattt agttcttgaa agtatttact atgtttttt i389 



<210> 2 
< 2 1 i 624 
<212> PKT 

<2H> Homo sapiens 
< 4 cc - : 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
1 5 10 15 

Glu Tnr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys 
20 25 30 

Trp Tnr Asn Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
3 5 4 0 4 5 

Asn Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 55 60 

Cvs Arc: civ Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp 
65 70 75 80 

Phe Aso Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr 
85 90 95 

Leu Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He 
100 105 110 

Glu Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
115 12 0 125 

Arg Kic Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 
130 13 5 14 0 

Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 
145 150 155 160 

Leu Aia Glu Met Glu Thr Phe Arg Asp He Leu Cys Arg Gin Val Asp 
165 170 175 

Thr Leu Gin Lys Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp 
180 185 190 
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Glu Leu Gin Arg 
195 

Thr Thr Arg Ser 
210 

Glu Lys Leu Phe 
225 

Phe Lys Gly Glu 



Ala Thr Leu Ser 
260 

Trp Gin Lys Arg 
275 

Glu Ala Tyr Lys 
290 

Gly Gly Pro Asp 
305 

Glu Phe Phe Asp 



Glu Glu Gin Ser 
340 

Leu Pro Ser Gly 
355 

Gin Lys Pro Tyr 
37 0 

Ser Ala Ser Asp 
385 

Val Gin Asn His 



Asn Trp Gin Leu 
420 

Glu Val Glu Glu 
435 

Ala Val Lys Gly 
450 

Val Asp Val Arg 
465 

Val Glu Thr Leu 



Arg Val Trp Pro 
500 

Arg Lys lie Pro 



Asp Lys Val Val 
200 

Asp Gly Asp Phe 
215 

Pro His Val Thr 
2 3 0 

Ala He Thr Phe 
245 

His Cys He Glu 



Leu Asp Lys Glu 
280 

Asn Ala Met Thr 
295 

Tyr Glu Glu Gly 
310 

Ala Val Glu Ala 
325 

Gin Ser Glu Lys 



Asp Ala Phe Ser 
360 

Ser Arg Ser Ser 
375 

Asp Val His Arg 
390 

Met Thr Tyr Ser 
405 

Val Val Glu Glu 



Asn Gly He Val 

440 

Val Thr Gly His 
455 

Asn Asp Trp Glu 
470 

Ala Asp Asn Ala 
485 

Ala Ser Gin Arg 



Ala Leu Thr Glu 



Glu Asp Asp Glu 



Leu His Ser Thr 
220 

Pro Lys Gly He 
235 

Lys Ala Thr Thr 
250 

Leu Met Val Lys 
265 

Thr Glu Lys Lys 



Glu Leu Lys Lys 
300 

Pro Asn Ser Leu 
315 

Ala Leu Asp Arg 
330 

Val Arg Leu His 
345 

Ser Val Gly Thr 



Ser Met Ser Ser 
380 

Phe Ser Ser Gin 
395 

Leu Gin Asp Val 
410 

Gly Glu Met Lys 
425 

Leu Asp Pro Leu 



Glu Val Cys Asn 
460 

Thr Thr He Glu 
475 

He He He Tyr 
490 

Asp Val Leu Tyr 
505 

Asn Asp Pro Glu 



Asp Asp Phe Pro 
205 

Asn Gly Asn Lys 



Asn Gly He Asp 
240 

Ala Gly He Leu 
255 

Arg Glu Asp Ser 
270 

Arg Arg Thr Glu 
285 

Lys Ser His Phe 



He Asn Glu Glu 
320 

Gin Asp Lys He 
335 

Trp Pro Thr Ser 
350 

His Arg Phe Val 
365 

He Asp Leu Val 



Val Glu Glu Met 
400 

Gly Gly Asp Ala 
415 

Val Tyr Arg Arg 
430 

Lys Ala Thr His 
445 

Tyr Phe Trp Asn 



Asn Phe His Val 
480 

Gin Thr His Lys 
495 

Leu Ser Val He 
510 

Thr Trp He Val 
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515 520 

Cys Asn Phe Ser Val Asp His Asp 
530 535 

Val Arg Ala Lys lie Asn Val Ala 
545 ' 550 

Pro Pro Glu Gly Asn Gin Glu lie 
565 

He Thr Tyr Val Ala Asn Val Asn 
580 

Val Leu Arg Ala Val Ala Lys Arg 
595 600 

Phe Thr Ser Tyr Val Gin Glu Lys 
610 615 



525 

Ser Ala Pro Leu Asn Asn Arg Cys 
540 

Met He Cys Gin Thr Leu Val Ser 
555 560 

Ser Arg Asp Asn He Leu Cys Lys 
570 575 

Pro Gly Gly Trp Ala Pro Ala Ser 
585 590 

Glu Tyr Pro Lys Phe Leu Lys Arg 
605 

Thr Ala Gly . Lys Pro He Leu Phe 
620 



<:io> 3 

■: 2 1 1 > 2762 

.H2> DNA 

«.-I3> Mus musculus 



ZZ0> 

:::> cos 



. Z2Z> <444) . . (2315) 
» 4 C 0 > 3 

cgggccacca cgtgtaaata gtatcggacc cggcaggaag atggcggctg tagcggaggt 60 

gtgagtgagt ggatctgggt ctctgccgtt ggcttggctc ttcccgtctt cctcccctcc 120 

tccctccctg actgaggttg gcatctaggg ggccgagttc aggtggcggc gccgggcgca 180 

qr^caggggt cacggccacg gcggctgacg gctggaaggg caggctttct tcgccgctcg 240 

::c:cc::cc ccggtccgct cggtgtcagg cgcggcggcg gcggcgcggc gggcgcgctt 300 

cq:ccc:c:t cctgttccct cactccccgg agcgggctct cttggcggtg ccatcccccg 360 

acccttcacc ccagggacta ggcgcctgca ctggcgcagc tcgcggagcg ggggccggtc 420 

tcctgcccgg ctgtcgcgtc tec atg teg gat aac cag age tgg aac teg teg 473 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser 
15 10 

gge teg gag gag gat ccg gag acg gag tec ggg ccg cct gtg gag cgc 521 
Ciy Ser Glu Glu Asp Pro Glu Thr Glu Ser Gly Pro Pro Val Glu Arg 
15 20 25 

tgc ggg gtc etc age aag tgg aca aac tat att cat gga tgg cag gat 569 
Cys Gly Val Leu Ser Lys Trp Thr Asn Tyr He His Gly Trp Gin Asp 

"30 35 40 

cgt tgg gta gtt ttg aaa aat aat act ttg agt tac tac aaa tct gaa 617 
Arg Trp Val Val Leu Lys Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu 
"45 " 50 55 
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gat gaa aca gaa tat ggc tgt agg gga tec ate tgt ctt age aag get 665 
Asp Glu Thr Glu Tyr Gly Cys Arg Gly Ser lie Cys Leu Ser Lys Ala 
60 65 70 

gtg ate acg cct cac gat ttt gat gaa tgc egg ttt gat ate agt gta 713 
Val lie Thr Pro His Asp Phe Asp Glu Cys Arg Phe Asp lie Ser Val 
75 80 85 90 

aat gat act gtt tgg tac ctt cga get cag gac ccg gag cac aga cag 761 
Asn A.sp Ser Val Trp Tyr Leu Arg Ala Gin Asp Pro Glu His Arg Gin 
95 100 105 

caa tgg gta gac gec att gaa cag cac aag act gaa teg gga tat gga 809 
Gin Trp Val Asp Ala lie Glu Gin His Lys Thr Glu Ser Gly Tyr Gly 
110 115 120 

tct gag tec age ttg cgt aga cat ggc tea atg gtg tea ctg gtg tct 857 
Ser Glu Ser Ser Leu Arg Arg His Gly Ser Met Val Ser Leu Val Ser 
125 130 135 

gga gcg agt ggc tat tct get acg tec ace tct tct ttc aag aaa ggc 905 
Gly Ala Ser Gly Tyr Ser Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly 
140 ~145 150 

cac agt tea cgt gag aaa ctg get gaa atg gag aca ttt egg gac ate 953 
His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr Phe Arg Asp lie 
155 160 165 170 

ccg tgc egg cag gtt gat act etc cag aag tac ttt gat gtc tgt get 1001 
Leu Cys Arg Gin Val Asp Thr Leu Gin Lys Tyr Phe Asp Val Cys Ala 
175 180 185 

gac get gtc tec aag gat gag ctt cag agg gat aaa gtc gta gaa gat 1049 
Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys Val Val Glu Asp 
190 195 200 

gat gaa gat gac ttc cct aca act cgt tct gat gga gac ttt ttg cac 1097 
Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly Asp Phe Leu His 
205 210 215 

aat ace aat ggt aat aaa gaa aaa tta ttt cea cat gta aca cca aaa 1145 
Asn Thr Asn Gly Asn Lys Glu Lys Leu Phe Pro His Val Thr Pro Lys 
220 225 230 

gga att aat ggc ata gac ttt aaa ggg gaa gca ata act ttt aaa gca 1193 
Gly lie Asn Gly lie Asp Phe Lys Gly Glu Ala lie Thr Phe Lys Ala 
235 240 245 250 

act act get gga ate ctt get aca ctt tct cat tgt att gaa tta atg 1241 
Thr Thr Ala Gly lie Leu Ala Thr . Leu Ser His Cys lie Glu Leu Met 
255 260 265 

gta aaa egg gaa gag age tgg caa aaa aga cac gat agg gaa gtg gaa 12 8 9 
Val Lys Arg Glu Glu Ser Trp Gin Lys Arg His Asp Arg Glu Val Glu 
270 275 280 

aag agg aga cga gtg gag gaa gcg tac aag aat gtg atg gaa gaa ctt 13 3 7 
Lys Arg Arg Arg Val Glu Glu Ala Tyr Lys Asn Val Met Glu Glu Leu 
285 290 295 

aag aag aaa ccc cgt ttc gga ggg ccg gat tat gaa gaa ggt cca aac 1385 
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Lys Lys Lys Pro Arg Phe Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn 
300 305 310 

agt ctg att aat gag gaa gag ttc ttt gat get gtt gaa get get ctt 1433 
Ser Leu lie Asn Glu Glu Glu Phe Phe Asp Ala Val Glu Ala Ala Leu 
315 320 325 330 

gac aga caa gat aaa ata gag gaa cag tea cag agt gaa aag gtc agg 14 81 
Asp Arg Gin Asp Lys lie Glu Glu Gin Ser Gin Ser Glu Lys Val Arg 
335 340 345 

tta cac tgg ccc aca tea ttg cca tet gga gac acc ttt tct tct gtc 1529 
Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Thr Phe Ser Ser Val 
350 355 360 

ggg acg cat aga ttt gta caa aag ccc tat agt ccc tct tec tec atg 1577 
Gly Thr His Arg Phe Val Gin Lys Pro Tyr Ser Arg Ser Ser Ser Met 
365 370 375 

tct tec att gat eta gtc agt gec tct gac gat gtt cac aga ttc age 1625 
Ser Ser lie Asp Leu Val Ser Ala Ser Asp Asp Val His Arg Phe Ser 
380 .385 390 

tec cag gtt gaa gaa atg gta cag aac cac atg aac tat tea tta cag 1673 
Ser Gin Val Glu Glu Met Val Gin Asn His Met Asn Tyr Ser Leu Gin 
395 400 405 410 

gat gta ggt ggt gat gca aat tgg caa ctg gtt gtt gaa gaa gga gaa 1721 
Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu Gly Glu 
415 420 425 

atg aag gta tac aga aga gaa gtg gaa gaa aat gga att gtt ctg gat 1769 
Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly lie Val Leu Asp 
430 435 440 

cct ttg aaa get act cat gca gtt aaa ggt gtt aca gga cat gag gtc 1817 
Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His Glu Val 
445 450 455 

tgc aat tac ttt tgg aat gtt gat gtt cgc aat gac tgg gaa act act 1865 
Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu Thr Thr 
460 465 470 

ata gaa aac ttt cat gtg gtg gaa aca tta get gat aat gca ate ate 1913 
Tie Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala lie He 
475 480 485 490 



gtt tat caa acg cac aag aga gta tgg ccc get tct cag aga gac gta 
Val Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg Asp Val 
495 500 505 



cct gaa act tgg ata gtt tgt aat ttt tct gtg gat cat gat agt get 
Pro Glu Thr Trp He Val Cys Asn Phe Ser Val Asp His Asp Ser Ala 
525 530 535 

cct ctg aac aat cga tgt gtc cgt gec aaa ate aat att get atg att 
Pro Leu Asn Asn Arg Cys Val Arg Ala Lys He Asn He Ala Met He 



1961 



ctg tat ctt tct get att cga aag ate cca gec ttg act gaa aat gat 2009 
Leu Tyr Leu Ser Ala He Arg Lys He Pro Ala Leu Thr Glu Asn Asp 
510 515 520 



2057 



2105 
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540 545 550 

tgt caa ace tta gta age cca cca gag gga gac cag gag ata age aga 2153 
Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asp Gin Glu He Ser Arg 
555 560 565 570 

gac aac att ctg tgc aag atcacg tat gta get aat gtg aac cca gga 2201 
Asp Asn He Leu Cys Lys He Thr Tyr Val Ala Asn Val Asn Pro Gly 
575 580 585 

gga tgg gcg cca get teg gtc tta aga gca gtg gca aag cga gaa tac 2249 
Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg Glu Tyr 
590 S95 600 

cct aag ttt eta aaa cgt ttt act tct tat gtc caa gaa aaa act gca 2297 
Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala 
605 610 615 

gga aaa cca att ttg ttt tagtattaac agtgactgaa geaaggctge 2345 
Gly Lys Pro He Leu Phe 
620 



gtgacgttcc 


atgttggaga 


aaggagggaa 


aaaataaaaa 


gaatcctcta 


agctggaacg 


2405 


taggatctac 


agecttgtet 


gtggcccaag 


aagaaacatt 


geaategtaa 


agctgggtat 


2465 


ccagcactag 


ccatctcctg 


ctaggcctcc 


tcgctcagcg 


tgtaactata 


aatacatgta 


2525 


gaatcacatg 


gatatggcta 


tatttttatt 


tgcttgctcc 


ttggagtgaa 


aacaaataac 


2585 


tttgaattac 


aactaggaat 


taaccgatgc 


tttaattttg 


aggaactttt 


tcagaatttt 


2645 


ttatttacca 


tggtccaacc 


taagatcctc 


agttgtatca 


agtttttgtg 


cacaaaagaa 


2705 


aagcacaaaa 


gttgaacgea 


cctgaaggca 


tgtgctctct 


gtgcaacaaa 


tactcag 


2762 



<210> 4 
<211> 624 
<212> PRT 

<213> Mus musculus 
<400> 4 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
1 5 10 15 

Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys 
20 25 " 30 

Trp Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
35 40 45 

Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 55 60 

Cys Arg Gly Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp 
'65 70 75 80 

Phe Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr 
85 90 95 



9 



WO 00/50607 



PCT/IB00/00324 



Leu Arg Ala Gin Asp Pro Glu His Arg Gin Gin Trp Val Asp Ala lie 
100 105 110 

Glu Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
115 120 125 

Arg His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 
130 135 140 

Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 
145 150 155 160 

Leu Ala Glu Met Glu Thr Phe Arg Asp lie Leu Cys Arg Gin Val Asp 
165 170 175 

Thr Leu Gin Lys Tyr Phe Asp Val Cys Ala Asp Ala Val Ser Lys Asp 
180 185 190 

Glu Leu Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro 
195 200 205 

Thr Thr Arg Ser Asp Gly, Asp Phe Leu His Asn Thr Asn Gly Asn Lys 
210 215 .220 

Glu Lys Leu Phe Pro His Val Thr Pro Lys Gly lie Asn Gly lie Asp 
225 230 235 240 

Phe Lys Gly Glu Ala lie Thr Phe Lys Ala Thr Thr Ala Gly lie Leu 
245 250 255 

Ala Thr Leu Ser His Cys lie Glu Leu Met Val Lys Arg Glu Glu Ser 
260 265 270 

Trp Gin Lys Arg His Asp Arg Glu Val Glu Lys Arg Arg Arg Val Glu 
275 280 285 

Glu Ala Tyr Lys Asn Val Met Glu Glu Leu Lys Lys Lys Pro Arg Phe 
290 295 300 

Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu lie Asn Glu Glu 
305 " 310 315 320 

Glu Phe Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys lie 
325 330 335 

Glu Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Pro Thr Ser 
340 345 350 

Leu Pro Ser Gly Asp Thr Phe Ser Ser Val Gly Thr His Arg Phe Val 
355 " 360 365 

Gin Lys Pro Tyr Ser Arg Ser Ser Ser Met Ser Ser lie Asp Leu Val 
370 375 380 

Ser Ala Ser Asp Asp Val His Arg Phe Ser Ser Gin Val Glu Glu Met 
335 390 395 400 

Val Gin Asn His Met Asn Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala 
405 410 415 

Asn Trp Gin Leu Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg 
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420 425 430 

Glu Val Glu Glu Asn Gly lie Val Leu Asp Pro Leu Lys Ala Thr His 
435 440 445 

Ala Val Lys Gly Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn 
450 455 460 

Val Asp Val Arg Asn Asp Trp Glu Thr Thr He Glu Asn Phe His Val 
465 470 475 480 

Val Glu Thr Leu Ala Asp Asn Ala lie He Val Tyr Gin Thr His Lys 
485 490 495 

Arg Val Trp Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Ala He 
500 505 510 

Arg Lys He Pro Ala Leu Thr Glu Asn Asp Pro Glu Thr Trp He Val 
515 520 525 

Cys Asn Phe Ser Val Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys 
530 535 540 

Val Arg Ala Lys He Asn He Ala Met He Cys Gin Thr Leu Val Ser 
545 550 555 560 

Pro Pro Glu Gly Asp Gin Glu He Ser Arg Asp Asn lie Leu Cys Lys 
565 570 575 

He Thr Tyr Val A.la Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser 
580 585 590 

Val Leu Arg Ala Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg 
595 600 605 

Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro He Leu Phe 
610 615 620 



<210> 5 
<211> 2361 
<212> DNA 
<213> Bos caurus 

<220> 
<221> CDS 

<222> (421) . . (2292) 
<400> 5 

cggcaggaag atggcggcct agcggaggtg tgagcggacc tgggtctccg cagctgggtt 60 

ttccctcttc ccgtctttct cctcttttcc tctcccccga ggttggcatc gagggggcca 120 

aattcgggcg gcggcgccgg gcgcagcgca ggggtcacaa cgacggcgac ggctgacggt 180 

tggaagggca ggcttccctc gcccctcgac ctccttcccc ggtccgcttg gtgtcaggcg 240 

cggcggcggc ggcggcggcg gcgcggcggg cggactccat ccctcctccc gctccctcct 300* 

gcaccggagc gggcactcct tccttcgcca tcccccgacc cttcaccccg gggactgggc 360 
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gcctccaccg gcgcagctca gggagcgggg gccggtctcc tgctcggctg tcgcgcctcc 420 

atg teg gat aac cag age tgg aac teg teg ggc teg gag gag gat ccg 463 
Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
15 10 15 

gag acg gag tec ggg ccg ccg, gtg gag cgc tgc gga gtc etc aac aag 516 
Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Asn Lys 
20 25 30 

tgg aca aac tat att cat ggg tgg cag gat cgc tgg gta gtt ttg aaa 564 
Trp Thr Asn Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
35 40 45 

aat aac act ctg agt tac tac aaa tct gaa gat gag aca gag tat ggc 612 
Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 55 60 

tgc aga gga tec ate tgt ctt age aag get gtc ate acg cct cat gat 660 
Cys Arg Gly Ser lie Cys Leu Ser Lys Ala Val lie Thr Pro His Asp 
65 70 7 5 8 0 

ttt gat gaa tgc cga ttt gat att agt gta aat gat agt gtt tgg tat 708 
Phe Asp Glu Cys Arg Phe Asp lie Ser Val Asn Asp Ser Val Trp Tyr 
85 90 95 

ctt cgt get caa gat cca gat cac aga cag cag tgg ata gat gee att 756 
Leu Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp lie Asp Ala lie 
100 105 110 

gaa cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt 804 
Glu Gin His Lvs Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
115 120 125 

cga cat ggc tec atg gta tea ttg gta tec gga gca agt ggc tat tct 852 
Arg His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 
130 135 140 

gca aca tec acc tec tea ttc aag aag ggc cac agt tta cgt gag aaa 
Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 
145 150 155 160 

ctg get gaa atg gaa acc ttt aga gat ata ctg tgt aga caa gtt gat 
Leu Ala Glu Met Glu Thr Phe Arg Asp lie Leu Cys Arg Gin Val Asp 
165 170 175 

acc eta cag aag ttc ttt gat gee tgt get gat get gtc tec aag gat 996 
Thr Leu Gin Lvs Phe Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp 
180 185 190 



900 



948 



gaa ttt caa agg gat aaa gtg gta gaa gat gat gaa gat gac ttt cct 
Glu Phe Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro 
195 200 205 

acg aca cgt tct gat gga gac ttc ttg cat aat acc aat ggc aat aag 
Thr Thr Arg Ser Asp Gly Asp Phe Leu His Asn Thr Asn Gly Asn Lys 
210 ' " 215 220 



1044 



1092 



gaa aag gta ttt cca cat gta aca cca aaa gga att aat ggt ata gac 1140 
Glu Lys Val Phe Pro His Val Thr Pro Lys Gly lie Asn Gly lie Asp 
225 230 235 240 
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ttt aaa ggt 
Phe Lys Gly 



get aca ctt 

Ala Thr Leu 

tag caa aag 

Trp Gin Lys 
275 

gaa gca cac 

Giu Ala Tyr 
290 

gga gaa cca 

Gly Giy Pro 
203 

gag ::: t:r 

Gii- r-rte p n e 



gaa gaa cag 
G 1 u G » u Gin 



atg cca tct 
Met Pre ~-r 

3 55 

caa aag ccc 
Gin Lys Pro 



agt ac: t rt 
Ser A : <j i>r 

ses 

gtg cag aac 
Val Gin Asn 



aac tgg cag 
As n Trp O It. 

gaa gta aaa 
Glu Val Giu 
4 3S 

gca gtt aaa 
Ala Val Lys 
450 

gtt gat att 

Val Asr Val 
465 



gag gcg ata 
Glu Ala He 
245 

tec cat tgt 
Ser His Cys 
260 

aga atg gac 
Arg Met Asp 



aaa aat gec 
Lys Asn Ala 



gat tat gag 
Asp Tyr Glu 
310 

gat get get 
Asp Ala Val 
325 

teg cag agt 
Ser Gin Ser 
340 

gga gat gec 
Giy Asp Ala 



tat agt cgc 
Tyr Ser Arg 



gac ggt get 
Asp Gly Val 
390 

cac atg ace 
His Met Thr 
405 

t tg gtt gta 
Leu Val Val 
420 

gaa aat ggg 
Giu Asn Gly 



ggc gtt aca 
Gly Val Thr 



cgc aat gat 
Arg Asn Asp 
470 



act ttt aaa 
Thr Phe Lys 



att gag ctg 
He Glu Leu 
265 

aag gaa act 
Lys Glu Thr 
280 

atg aca gaa 
Met Thr Glu 

295 

gaa ggc cca 
Glu Gly Pro 



gaa get get 
Glu Ala Ala 



gaa aag gtc 
Glu Lys Val 
345 

ttt tct tct 
Phe Ser Ser 
360 

tct tec tec 
Ser Ser Ser 

375 

cac aga ttc 
His Arg Phe 



tat tea ttg 
Tyr Ser Leu 



gaa gaa ggg 
Glu Glu Gly 
425 

att gtt ctg 
He Val Leu 
440 

gga cac gag 
Gly His Glu 
455 

tgg gaa aca 
Trp Glu Thr 



gca act act 
Ala Thr Thr 
250 

atg gta aaa 
Met Val Lys 



gag aag aga 
Glu Lys Arg 



ctt aag aaa 
Leu Lys Lys 
300 

aac agt ttg 
Asn Ser Leu 
315 

ctt gac aga 
Leu Asp Arg 
330 

agg tta cat 

Arg Leu His 



gtg ggg act 

Val Gly Thr 



atg tct tec 
Met Ser Ser 
380 

age tec cag 
Ser Ser Gin 
395 

cag gat gta 
Gin Asp Val 
410 

gag atg aag 
Glu Met Lys 



gat cct ttg 
Asp Pro Leu 



gtc tgc aat 
Val Cys Asn 
460 

act ata gaa 
Thr He Giu 
475 



gee gga ate 
Ala Gly He 
255 

cgt gag gac 
Arg Glu Asp 
270 

aga aga gtg 

Arg Arg Val 

285 . 

aaa tec cac 

Lys Ser His 



att aat gaa 

He Asn Glu 



caa gat aaa 
Gin Asp Lys 
335 

tgg tct act 
Trp Ser Thr 
350 

cat aga ttt 
His Arg Phe 
365 

att gat eta 
He Asp Leu 



gtt gaa gag 
Val Glu Glu 



ggt ggg gac 
Gly Gly Asp 
415 

gta tat aga 
Val Tyr Arg 
430 

aaa get acc 

Lys Ala Thr 
445 

tac ttc tgg 

Tyr Phe Trp 



aac ttt cat 
Asn Phe His 



ctt 1188 
Leu 



age 1236 
Ser 



gag 1284 
Glu. 



ttt 1332 
Phe 



gag 1380 

Glu 

320 

ata 1428 
He 



tea 1476 
Ser 



gtc 1524 
Val 



gtc 1572 
Val 



atg 1620 

Met 

400 

gee 1668 
Ala 



aga 1716 
Arg 



cat 1764 
His 



aat 1812 
Asn 



gtg 1860 

Val 

480 
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gtg gaa aca tta get gat aat gca ate ate att tat caa acg cac aag 1908 
Val Glu Thr Leu Ala Asp Asn Ala He He He Tyr Gin Thr His Lys 
485 490 4S5 

aga gtg tgg cca gec tct cag egg gat gtc tta tat ctg tct gee att 1956 
Arg Val Trp Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Ala He 
500 , 505 510 

cga aag ata cca get ttg aat gaa aat gac ccg gag act tgg ata gtt 2004 
Arg Lys He Pro Ala Leu Asn Glu Asn Asp Pro Glu Thr Trp He Val 
515 520 525 

tgt aat ttt tct gta gat cac age agt get cct eta aac aat cga tgt 2052 
Cys Asn Phe Ser Val Asp His Ser Ser Ala Pro Leu Asn Asn Arg Cys 
530 535 540 

gtc cgt gee aaa ata aac gtt get atg att tgt cag ace ttg gtg age 2100 
Val Arg Ala Lys He Asn Val Ala Met He Cys Gin Thr Leu Val Ser 
545 550 555 560 

ccc cca gag gga aac cag gag att age agg gac aac att eta tgc aag 214 8 
Pro Pro Glu Gly Asn Gin Glu He Ser Arg Asp Asn He Leu Cys Lys 
565 570 575 

att aca tac gtg gee aat gta aac cct gga gga tgg gee cca gee tea 2196 
He Thr Tyr Val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser 
580 585 590 

gtg tta egg gca gtg gca aag cga gaa tat cca aag ttt eta aag cgt 2244 
Val Leu Arg Ala Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg 
595 600 605 

ttt act tct tac gta caa gaa aaa act gca gga aaa cct att ttg ttc 2292 
Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro He Leu Phe 
610 615 620 

tagtattaac agtgactgaa gcaaggctgt gtgacattcc atgttggagg aaaaaaaaaa 23 52 

aaaaaaaaa 2361 



<210> 6 
<211> 624 
<212> PRT 
<213> Bos taurus 

<400> 6 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
15 10 15 

Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Asn Lys 
20 25 30 

Trp Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
35 40 45 

Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 55 60 

Cys Arg Gly Ser He Cys Leu Ser Lys Ala Val lie Thr Pro His Asp 
65 70 75 80 
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Phe Asp Glu Cys Arg Phe Asp lie Ser Val Asn Asp Ser Val Trp Tyr 
85 90 95 

Leu Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp lie Asp Ala lie 
100 105 ' 110 

Glu Gin His Lys Thr Glu Ser *Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
115 120 125 

Arg His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 
130 135 140 

Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 
145 150 155 160 

Leu Ala Glu Met Glu Thr Phe Arg Asp lie Leu Cys Arg Gin Val Asp 
165 170 175 

Thr Leu Gin Lys Phe Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp 
180 185 190 

Phe Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro 
195 200 205 

TJ-.r Thr Arg Ser Asp Gly Asp Phe Leu His Asn Thr Asn Gly Asn Lys 
2 : 0 215 220 

Lys Val Phe Pro His Val Thr Pro Lys Gly lie Asn Gly lie Asp 

:: r - 230 235 240 

?rie L/z Gly Glu Ala lie Thr Phe Lys Ala Thr Thr Ala Gly lie Leu 
245 250 ' 255 

A » a Tnr Leu Ser His Cys lie Glu Leu Met Val Lys Arg Glu Asp Ser 
260 265 270 

Trp Cln Lys Arg Met Asp Lys Glu Thr Glu Lys Arg Arg Arg Val Glu 
Z75 280 285 

3iu Ala Tyr Lys Asn Ala Met Thr Glu Leu Lys Lys Lys Ser His Phe 
290 295 " 300 

G.y Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu lie Asn Glu Glu 
305 310 315 320 

Glu Pne Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys lie 
325 330 335 

Z L u Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Ser Thr Ser 
340 345 350 

Met Pro Ser Gly Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val 
355 360 365 

Gin Lys Pro Tyr Ser Arg Ser Ser Ser Met Ser Ser He Asp Leu Val 
370 375 380 

Ser Ala Ser Asp Gly Val His Arg Phe Ser Ser Gin Val Glu Glu Met 
385 390 395 400 
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Val Gin Asn His Met Thr Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala 
405 410 415 

Asn Trp Gin Leu Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg 
420 425 430 

Glu Val Glu Glu Asn Gly He, Val Leu Asp Pro Leu Lys Ala Thr His 
435 440 445 

Ala Val Lys Gly Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn 
450 455 460 

Val Asp Val Arg Asn Asd Trp Glu Thr Thr He Glu Asn Phe His Val 
465 ~ 470 475 480 

Val Glu Thr Leu Ala Asp Asn Ala He He He Tyr Gin Thr His Lys 
485 490 495 

Arg Val Trp Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Ala He 
500 505 510 

Arg Lys He Pro Ala Leu Asn Glu Asn Asp Pro Glu Thr Trp He Val 
515 520 _ 525 

Cvs Asn Phe Ser Val Asp His Ser Ser Ala- Pro Leu Asn Asn Arg Cys 
530 535 540 

Val Arg Ala Lvs He Asn Val Ala Met He Cys Gin Thr Leu Val Ser 
545 550 555 560 

Pro Pro Glu Gly Asn Gin Glu He Ser Arg Asp Asn He Leu Cys Lys 
565 570 575 

He Thr Tyr Val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser 
580 585 590 

Val Leu Arg Ala Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg 
595 600 605 

Phe Thr Ser Tvr Val Gin Glu Lys Thr Ala Gly Lys Pro He Leu Phe 
610 615 620 



<210> 7 
<211> 2187 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (391) . . (2187) 
<400> 7 

tagcggaggt gtgagtggac gcgggactca gcggccggat tttctcttcc cttcttttcc 60 
cttttccttc cctatttgaa attggcatcg agggggctaa gttcgggtgg cagcgccggg 120 
cgcaacgcag gggtcacggc gacggcggcg gcggctgacg gctggaaggg taggcttcat 180 
tcaccgctcg tcctccttcc tcgctccgct cggtgtcagg cgcggcggcg gcgcggcggg 240 
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cggacttcgt ccctcctcct gctccccccc acaccggagc gggcactctt cgcttcgcca 300 

tcccccgacc cttcaccccg aggactgggc gcctcctccg gcgcagctga gggagcgggg 360 

gccggtctcc tgctcggttg tcgagcctcc atg teg gat aat cag age tgg aac 414 

Met Ser Asp Asn Gin Ser Trp Asn 
1 5 

teg teg ggc teg gag gag gat cca gag acg gag tct ggg ccg cct gtg 462 
Ser Ser Gly Ser Glu Glu Asp Pro Glu Thr Glu Ser Gly Pro Pro Val 
10 15 20 

gag cgc tgc ggg gtc etc agt aag tgg aca aac tac att cat ggg tgg 510 
Glu Arg Cys Gly Val Leu Ser Lys Trp Thr Asn Tyr lie His Gly Trp 
25 30 35 40 

cag gat cgt tgg gta gtt ttg aaa aat aat get ctg agt tac tac aaa 558 
Gin Asp Arg Trp Val Val Leu Lys Asn Asn Ala Leu Ser Tyr Tyr Lys 
45 50 55 

tct gaa gat gaa aca gag tat ggc tgc aga gga tec ate tgt ctt age 606 
Ser Glu Asp Glu Thr Glu Tyr Gly Cys Arg Gly Ser lie Cvs Leu Ser 
60 65 70 

aag get gtc ate aca cct cac gat ttt gat gaa tgt cga ttt gat att 654 
Lys Ala Val lie Thr Pro His Asp Phe Asp Glu Cys Arg Phe Asp He 
75 80 85 

agt gta aat gat agt gtt tgg tat ctt cgt get cag gat cca gat cat 702 
Ser Val Asn Asp Ser Val Trp Tyr Leu Arg Ala Gin Asp Pro Asp His 
90 95 100 

aga cag caa tgg ata gat gee att gaa cag cac aag act gaa tct gga 750 
Arg Gin Gin Trp He Asp Ala He Glu Gin His Lys Thr Glu Ser Gly 
105 110 115 120 

tat gga tct gaa tec age ttg cgt cga cat ggc tea atg gtg tec ctg 798 
Tyr Gly Ser Glu Ser Ser Leu Arg Arg His Gly Ser Met Val Ser Leu 
125 130 135 

gtg tct gga gca agt ggc tac tct gca aca tec ace tct tea ttc aag 846 
Val Ser Gly Ala Ser Gly Tyr Ser Ala Thr Ser Thr Ser Ser Phe Lys 
140 145 150 

aaa ggc cac agt tta cgt gag aag ttg get gaa atg gaa aca ttt aga 894 
Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr Phe Arg 
155 160 165 

gac ate tta tgt aga caa gtt gac acg eta cag aag tac ttt gat gec 942 
Asp He Leu Cys Arg Gin Val Asp Thr Leu Gin Lys Tyr Phe Asp Ala 
170 175 180 

tgt get gat get gtc tct aag gat gaa ctt caa agg gat aaa gtg gta 990 
Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys Val Val 
185 190 195 200 

gaa gat gat gaa gat gac ttt cct aca acg cgt tct gat ggc gac ttc 1038 
Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly Asp Phe 
205 210 215 

ttg cat agt ace aac ggc aat aaa gaa aag tta ttt cca cat gtg aca 1086 
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Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu Phe Pro His Val Thr 
220 225 230 

cca aaa gga att aat ggt ata gac ttt aaa ggg gaa gcg ata act ttt 1134 
Pro Lys Gly lie Asn Gly lie Asp Phe Lys Gly Glu Ala lie Thr Phe 
235 240 245 

aaa gca act: act get gga ate ctt gca aca ctt tct cat tgt att gaa 1182 
Lys Ala Thr Thr Ala Gly lie Leu Ala Thr Leu Ser His Cys lie Glu 
250 255 260 

eta atg gtt aaa cgt gag gac age tgg cag aag aga ctg gat aag gaa 123 0 
Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys Arg Leu Asp Lys Glu 
265 270 275 280 

act gag aag aaa aga aga aca gag gaa gca tat aaa aat gca atg aca 1278 
Thr Glu Lys Lys Arg Arg Thr Glu Glu Ala Tyr Lys Asn Ala Met Thr 
285 290 295 

gaa ctt aag aaa aaa tec cac ttt gga gga cca gat tat gaa gaa ggc 132 6 
Glu Leu Lys Lys Lys Ser His Phe Gly Gly Pro Asp Tyr Glu Glu Gly 
300 305 310 

cct aac agt ctg att aat gaa gaa gag ttc ttt gat get gtt gaa get 1374 
Pro Asn Ser Leu lie Asn Glu Glu Glu Phe" Phe Asp Ala Val Glu Ala 
315 320 325 

art ctt gac aga caa gat aaa ata gaa gaa cag tea cag agt gaa aag 1422 
Ala Leu Asp Arg Gin Asp Lys lie Glu Glu Gin Ser Gin Ser Glu Lys 
330 335 340 



atg aaa tta cat tgg cct aca tec ttg ccc tct gga gat gee ttt tct 
Val Arg Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Ala Phe Ser 
34S 350 355 360 



1470 



».:: gtg ggg aca cat aga ttt gtc caa aag gtt gaa gag atg gtg cag 1518 
5er val Glv Thr His Arg Phe Val Gin Lys Val Glu Glu Met Val Gin 
365 370 375 

aac cac atg act tac tea tta cag gat gta ggc gga gat gee aat tgg 1566 
Ai:n Hie yet Thr Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp 
380 385 390 

cag ttg gtt gta gaa gaa gga gaa atg aag gta tac aga aga gaa gta 1614 
Gin Leu Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg Glu Val 
395 400 405 

gaa gaa aat ggg att gtt ctg gat cct tta aaa get acc cat gca gtt 
Glu Glu Asn Gly lie Val Leu Asp Pro Leu Lys Ala Thr His Ala Val 
41C 415 420 

aaa ggc gtc aca gga cat gaa gtc tgc aat tat ttc tgg aat gtt gac 
Lys Gly Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val Asp 
425 430 435 440 

gtt cgc aat gac tgg gaa aca act ata gaa aac ttt cat gtg gtg gaa 
Val Arg Asn Asp Trp Glu Thr Thr lie Glu Asn Phe His Val Val Glu 
445 450 455 

aca tta get gat aat gca ate ate att tat caa aca cac aag agg gtg 1806 
Thr Leu Ala Asp Asn Ala He He He Tyr Gin Thr His Lys Arg Val 



1662 



1710 



1758 
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460 465 470 

tgg cct get tct cag cga gac gta tta tat ctt tct gtc att cga aag 1854 

Trp Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Val He Arg Lys 

475 480 485 

ata cca gec ttg act gaa aat gac cct gaa act tgg ata gtt tgt aat 1902 

He Pro Ala Leu Thr Glu Asn Asp Pro Glu Thr Trp He Val Cys Asn 

490 495 500 

ttt tct gtg gat cat gac agt get cct eta aac aac cga tgt gtc cgt 1950 

Phe Ser Val Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys Val Arg 

505 510 515 " 520 

gec aaa ata aat gtt get atg att tgt caa acc ttg gta age cca cca 1998 

Ala Lys He Asn Val Ala Met He Cys Gin Thr Leu Val Ser Pro Pro 
525 530 535 

gag gga aac cag gaa att age agg gac aac att eta tgc aag att aca 2 04 6 

Glu Gly Asn Gin Glu He Ser Arg Asp Asn He Leu Cvs Lys He Thr 

540 545 ' 550 

tat gta get aat gtg aac ecu gga gga tgg gca cca gee tea gtg tta 2094 

Tyr Val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser Val Leu 

555 560 565 

agg gca gtg gca aag cga gag tat cct aaa ttt eta aaa cgt ttt act 2142 

Arg Ala Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr 

570 575 580 

tct tac gtc caa gaa aaa act gca gga aag cct att ttg ttc tag 2187 

Ser Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro lie Leu Phe 
585 590 * 595 



<210> 8 

<211> 598 

<212> PRT 

<213> Homo sapiens 

<400> 8 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
1 5 10 15 

Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys 
20 25 ~ 3 0 

Trp Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
3 5 4 0 4 5 

Asn Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 55 " 60 

Cys Arg Gly Ser lie Cys Leu Ser Lys Ala Val He Thr Pro His Asp 
65 70 75 80 

Phe Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr 
85 90 95 

Leu Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He 
100 105 110 
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Glu Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
115 120 * 125 

Arg His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 
13 0 13 S 14 0 

Ala Thr Ser Thr Ser Ser Phe* Lys Lys Gly His Ser Leu Arg Glu Lys 
145 150 155 160 

Leu Ala Glu Mec Glu Thr Phe Arg Asp lie Leu Cys Arg Gin Val Asp 
165 170 175 



Thr Leu Gin Lys 
180 

Glu Leu Gin Arg 
195 

Thr Thr Arg Ser 
210 

Glu Lys Leu Phe 
225 

Phe Lys Gly Glu 



Ala Thr Leu Ser 
260 

Trp Gin Lys Arg 
275 

Glu Ala Tyr Lys 
290 

Gly Gly Pro Asp 
305 

Glu Phe Phe Asp 



Glu Glu Gin Ser 
340 

Leu Pro Ser Gly 
355 

Gin Lys Val Glu 
370 

Asp Val Gly Gly 
385 

Met Lys Val Tyr 



Pre Leu Lys Ala 
420 



Tyr Phe Asp Ala 



Asp Lys Val Val 
200 

Asp Gly Asp Phe 
215 

Pro His Val Thr 
230 

Ala lie Thr Phe 
245 

His Cys lie Glu 



Leu Asp Lys Glu 
280 

Asn Ala Met Thr 
295 

Tyr Glu Glu Gly 
310 

Ala Val Glu Ala 
325 

Gin Ser Glu Lys 



Asp Ala Phe Ser 
360 

Glu Met Val Gin 
375 

Asp Ala Asn Trp 
390 

Arg Arg Glu Vai 
405 

Thr His Ala Val 



Cys Ala Asp Ala 
185 

Glu Asp Asp Glu 



Leu His Ser Thr 
220 

Pro Lys Gly lie 
235 

Lys Ala Thr Thr 
250 

Leu Met Val Lys 
265 

Thr Glu Lys Lys 



Glu Leu Lys Lys 
300 

Pro Asn Ser Leu 
315 

Ala Leu Asp Arg 
330 

Val Arg Leu His 
345 

Ser Val Gly Thr 



Asn His Met Thr 
380 

Gin Leu Val Val 
395 

Glu Glu Asn Gly 
410 

Lys Gly Val Thr 
425 



Val Ser Lys Asp 
190 

Asp Asp Phe Pro 
205 

Asn Gly Asn Lys 



Asn Gly lie Asp 
240 

Ala Gly lie Leu 
255 

Arg Glu Asp Ser 
270 

Arg Arg Thr Glu 
285 

Lys Ser His Phe 



lie Asn Glu Glu 
320 

Gin Asp Lys lie 
335 

Trp Pro Thr Ser 
350 

His Arg Phe Val 
365 

Tyr Ser Leu Gin 



Glu Glu Gly Glu 
400 

lie Val Leu Asp 
415 

Gly His Glu Val 
430 
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Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu Thr Thr 
435 440 445 

lie Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala lie lie 
450 455 460 

lie Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg Asp Val 
465 470 ' 475 480 

Leu Tyr Leu Ser Val lie Arg Lys lie Pro Ala Leu Thr Glu Asn Asp 
485 490 495 

Pro Glu Thr Trp lie Val Cys Asn Phe Ser Val Asp His Asp Ser Ala 
500 505 510 

Pro Leu Asn Asn Arg Cys Val Arg Ala Lys lie Asn Val Ala Met lie 
515 520 525 

Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn Gin Glu lie Ser Arg 
530 535 A 540 

Asp Asn lie Leu Cys Lys lie Thr Tyr Val Ala Asn Val Asn Pro Gly 
545 550 " 555 560 

Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg Glu Tyr 
565 570 ~ 575 

Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala 
580 '585 590 

Gly Lys Pro lie Leu Phe 
595 



<210> 9 
<211> 2684 
<212> DNA 

<213> Mus musculus 

<220> 
<221> CDS 

<222> (444) . . (2237) 
<400> 9 

cgggccacca cgtgtaaata gtatcggacc cggcaggaag atggcggctg tagcggaggt 60 

gtgagtgagt ggatctgggt ctctgccgtt ggcttggctc ttcccgtctt cctcccctcc 120 

cccctccctg actgaggttg gcatctaggg ggccgagttc aggtggcggc gccgggcgca 180 

gcgcaggggt cacggccacg gcggctgacg gctggaaggg caggctttct tcgccgctcg 240 

tcctccttcc ccggtccgct cggtgtcagg cgcggcggcg gcggcgcggc gggcgcgctt 300 

cgtccctctt cctgttcccc cactccccgg agcgggctct cttggcggtg ccatcccccg 360 

acccttcacc ccagggacta ggcgcctgca ctggcgcagc tcgcggagcg ggggccggtc 420 

tcctgctcgg ctgtcgcgtc tec atg teg gat aac cag age tgg aac teg teg 473 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser 
15 10 
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ggc teg gag gag gat ccg gag acg gag tec ggg ccg cct gtg gag cgc 521 
Gly Ser Glu Glu Asp Pro Glu Thr Glu Ser Gly Pro Pro Val Glu Arg 
15 20 25 



tgc ggg gtc etc age aag tgg aca aac tat att cat gga tgg cag gat 
Cys Gly Val Leu Ser Lys Trp^ Thr Asn Tyr lie His Gly Trp Gin Asp 
30 35 40 

cgt tgg gta gtt ttg aaa aat aat act ttg agt tac tac aaa tct gaa 
Arg Trp Val Val Leu Lys Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu 
45 50 55 

gat gaa aca gaa tat ggc tgt agg gga tec ate tgt ctt age aag get 
Asp Glu ,Thr Glu Tyr Gly Cys Arg Gly Ser lie Cys Leu Ser Lys Ala 
60 65 70 

gtg ate acg cct cac gat ttt gat gaa tgc egg ttt gat ate. agt gta 
Val lie Thr Pro His Asp Phe Asp Glu Cys Arg Phe Asp lie Ser Val 
75 80 85 90 



569 



617 



665 



713 



857 



905 



953 



aat gat agt gtt tgg tac ctt cga get cag gac ccg gag cac aga cag 761 
Asn Asp Ser Val Trp Tyr Leu Arg Ala Gin Asp Pro Glu His Arg Gin 
95 100 105 

caa tgg gta gac gee att gaa cag cac aag act gaa teg gga tat gga 809 
Gin Trp Val Asp Ala lie Glu Gin His Lys Thr Glu Ser Gly Tyr Gly 
110 115 120 

tct gag tec age ttg cgt aga cat ggc tea atg gtg tea ctg gtg tct 
Ser Glu Ser Ser Leu Arg Arg His Gly Ser Met Val Ser Leu Val Ser 
125 130 135 

gga gcg agt ggc tat tct get acg tec ace tct tct ttc aag aaa ggc 
Gly Ala Ser Gly Tyr Ser Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly 
140 145 150 

cac agt tea cgt gag aaa ctg get gaa atg gag aca ttt egg gac ate 
His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr Phe Arg Asp lie 
155 " 160 165 170 

ctg tgc egg cag gtt gat act etc cag aag tac ttt gat gtc tgt get 
Leu Cys Arg Gin Val Asp Thr Leu Gin Lys Tyr Phe Asp Val Cys Ala 
175 180 185 

gac get gtc tec aag gat gag ctt cag agg gat aaa gtc gta gaa gat 
Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys Val Val Glu Asp 
190 195 200 

gat gaa gat gac ttc cct aca act cgt tct gat gga gac ttt ttg cac 
Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly Asp Phe Leu His 
205 210 215 

aat ace aat ggt aat aaa gaa aaa tta ttt cca cat gta aca cca aaa 
Asn Thr Asn Gly Asn Lys Glu Lys Leu Phe Pro His Val Thr Pro Lys 
220 225 230 

gga att aat ggc ata gac ttt aaa ggg gaa gca ata act ttt aaa gca 1193 
Gly lie Asn Gly He Asp Phe Lys Gly Glu Ala He Thr Phe Lys Ala 
235 240 245 250 



1001 



1049 



1097 



1145 
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act act get gga ate ctt get aca ctt tct cat tgt att gaa tta atg 1241 
Thr Thr Ala Gly lie Leu Ala Thr Leu Ser His Cys lie Glu Leu Met 
255 260 265 

gta aaa egg gaa gag age tgg caa aaa aga cac gat agg gaa gtg gaa 128 9 
Val Lys Arg Glu Glu Ser Trp Gin Lys Arg His Asp Arg Glu Val Glu 
270 275 280 

aag agg aga cga gtg gag gaa gcg tac aag aat gtg atg gaa gaa ctt 1337 
Lys Arg Arg Arg Val Glu Glu Ala Tyr Lys Asn Val Met Glu Glu Leu 
285 290 295 

aag aag aaa ccc cgt ttc gga ggg ccg gat tat gaa gaa ggt cca aac 13 85 
Lys Lys Lys Pro Arg Phe Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn 
300 305 310 

agt ctg att aat gag gaa gag ttc ttt gat get gtt gaa get get ctt 1433 
Ser Leu lie Asn Glu Glu Glu Phe Phe Asp Ala Val Glu Ala Ala Leu 
315 320 " 325 330 

gac aga caa gat aaa ata gag gaa cag tea cag agt gaa aag gtc agg 1481 
Asp Arg Gin Asp Lys lie Glu Glu Gin Ser Gin Ser Glu Lys Val Arg 
335 * 340 345 

tta cac tgg ccc aca tea ctg cca tct gga gac acc ttt tct tct gtc 1529 
Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Thr Phe Ser Ser Val 
350 355 360 

ggg acg cat aga ttt gta caa aag gtt gaa gaa atg gta cag aac cac 1577 
Gly Thr His Arg Phe Val Gin Lys Val Glu Glu Met Val Gin Asn His 
365 370 375 

atg aac tat tea tta cag gat gta ggt ggt gat gca aat tgg caa ctg 1625 
Met Asn Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp Gin Leu 
380 385 390 

gtt gtt gaa gaa gga gaa atg aag gta tac aga aga gaa gtg gaa gaa 1673 
Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg Glu Val Glu Glu 
395 400 405 410 

aat gga art gtt ctg gat cct ttg aaa get act cat gca gtt aaa ggt 1721 
Asn Gly He Val Leu Asp Pro Leu Lys Ala Thr His Ala Val Lys Gly 
415 420 425 

gtt aca gga cat gag gtc tgc aat tac ttt tgg aat gtt gat gtt cgc 1769 
Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val Asp Val Arg 
430 435 440 

aat gac tgg gaa act act ata gaa aac ttt cat gtg gtg gaa aca tta 1817 
Asn Asp Trp Glu Thr Thr He Glu Asn Phe His Val Val Glu Thr Leu 
445 450 455 

get gat aac gca ate ate gtt tat caa acg cac aag aga gta tgg ccc 1865 
Ala Asp Asn Ala He He Val Tyr Gin Thr His Lys Arg Val Trp Pro 
460 465 470 

get tct cag aga gac gta ctg tat ctt tct get att cga aag ate cca 1913 
Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Ala He Arg Lys He Pro 
475 480 485 490 

gee ttg act gaa aat gat cct gaa act tgg ata gtt tgt aat ttt tct 1961 
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Ala Leu Thr Glu Asn Asp Pro Glu Thr Trp lie Val Cys Asn Phe Ser 
495 500 505 

gtg gat cat gat agt get cct ctg aac aat cga tgt gtc cgt gec aaa 2009 
Val Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys Val Arg Ala Lys 
510 515 520 

ate aat att get atg att tgt caa act tta gta age cca cca gag gga 2057 
Ilje Asn lie Ala Met lie Cys Gin Thr Leu Val Ser Pro Pro Glu Gly 
525 530 535 

gac cag gag ata age aga gac aac att ctg tgc aag ate acg tat gta 2105 
Asp Gin Glu lie Ser Arg Asp Asn He Leu Cys Lys He Thr Tyr Val 
540 545 550 

get aat gtg aac cca gga gga tgg gcg cca get teg gtc tta aga gca 2153 
Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser Val Leu Arg Ala 
555 560 565 570 

gtg gca aag cga gaa tac cct aag ttt eta aaa cgt ttt act tct tat 2201 
Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr 
575 580 585 



gtc caa gaa aaa act gca gga aaa cca att ttg ttt tagtattaac 
Val Gin Glu Lys Thr Ala Gly Lys Pro He Leu Phe 
590 595 



2247 



agtgactgaa geaaggctge gtgacgttcc atgttggaga aaggagggaa aaaataaaaa 23 07 
gaatcctcta agctggaacg taggatctac agecttgtet gtggcccaag aagaaacatt 2367 
geaategtaa agctgggtat ccagcactag ccatctcctg ctaggcctcc tcgctcagcg 2427 
tgtaactata aatacatgta gaatcacatg gatatggcta tatttttatt tgcttgctcc 2487 
ttggagtgaa aacaaataac tttgaattac aactaggaat taaccgatgc tttaattttg 2547 
aggaactttt tcagaatttt ttatttacca tggtccaacc taagatcctc agttgtatca 2607 
agtttttgng cacaaaagaa aagcacaaaa gttgaacgea cctgaaggca tgtgctctct 2667 
gtgcaacaaa tactcag 2684 

<210> 10 

<211> 598 

<212> PRT 

<213> Mus musculus 

<400> 10 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
1 * 5 10 15 

Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys 
2 0 25 3 0 

Trp Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
35 40 45 

Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 55 60 
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Cys Arg Gly Ser lie Cys Leu Ser Lys Ala Val He Thr Pro His Asp 
65 70 75 80 

Phe Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr 
85 90 95 

Leu Arg Ala Gin Asp Pro Glu His Arg Gin Gin Trp Val Asp Ala He 
100 105 " 110 

Glu Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
115 120 125 

Arg His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tvr Ser 
130 135 14 0 

Aia Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 
H5 150 155 160 

Leu Ala Glu Men Glu Thr Phe Arg Asp He Leu Cys Arg Gin Val Asp 
165 170 * ~ 175 

Thr Leu Gir. Lys Tyr Phe Asp Val Cys Ala Asp Ala Val Ser Lys Asp 
180 185 * 190 

Glu Leu Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro 
195 2 00 2 05 

Thr Tr.r Arg Ser Asp Gly Asp Phe Leu His Asn Thr Asn Gly Asn Lys 
21C 215 22 0 

Glu Ly.- Leu Phe Pro His Val Thr Pro Lys Gly lie Asn Gly lie Asp 
225 230 235 240 

Phe Lys Gly Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly lie Leu 
245 250 255 

Ala Thr Leu Ser His Cys lie Glu Leu Met Val Lys Arg Glu Glu Ser 
260 265 270 

Trp Gin Lvs Arg His Asp Arg Glu Val Glu Lys Arg Arg Arg Val Glu 

280 285 

Glu Ala Tyr Lys Asn Val Met Glu Glu Leu Lys Lys Lys Pro Arg Phe 
29C 295 300 

Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu He Asn Glu Glu 
305 310 315 320 

Glu Phe P~- Asp Aia Val Glu Ala Ala Leu Asp Arg Gin Asp Lys lie 
325 330 335 

Glu Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Pro Thr Ser 
340 345 350 

Leu Pre Ser Gly Asp Thr Phe Ser Ser Val Gly Thr His Arg Phe Val 
11\ 360 ' 365 

Gin Lyr. *. a l G-u Glu Met Val Gin Asn His Met Asn Tvr Ser Leu Gin 
2~0 375 380 
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Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu Gly Glu 
385 390 395 400 

Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly lie Val Leu Asp 
405 410 415 

Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His Glu Val 
420 425 430 

Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu Thr Thr 
435 440 445 

He Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala He He 
450 455 460 

Val Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg Asp Val 
465 470 475 480 

Leu Tyr Leu Ser Ala He Arg Lys lie Pro Ala Leu Thr Glu Asn Asp 
485 490 495 

Pro Glu Thr Trp lie Val Cys Asn Phe Ser Val Asp His Asp Ser Ala 
500 505 510 

Pro Leu Asn Asn Arg Cys Val Arg Ala Lys He Asn He Ala Met He 
515 520 525 

Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asp Gin Glu lie Ser Arg 
530 535 540 

Asp Asn lie Leu Cys Lys lie Thr Tyr Val Ala Asn Val Asn Pro Gly 
545 * 550 555 560 

Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg Glu Tyr 
565 570 575 

Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala 
580 585 590 

Gly Lys Pro He Leu Phe 
595 



<210> 11 
<211> 2283 
<212> DNA 
<213> Bos taurus 

<220> 
<221> CDS 

<222> (421) . . (2214) 
<400> 11 

cggcaggaag atggcggcct agcggaggtg tgagtggacc tgggtctctg cagctgggtt 60 
ttccctcttc ccgtctttct cctcttttcc tctcccccga ggttggcatc gagggggcca 120 
aattcgggcg gcggcgccgg gcgcagcgca ggggtcacaa cgacggcgac ggctgacggt 180 
tggaagggca ggcttccttc gcccctcgac ctccttcccc ggtccgcttg gtgtcaggcg 240 
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cggcggcggc ggcggcggcg gcgcggcggg cggactccat 

gcaccggagc gggcactcct tccttcgcca tcccccgacc 

gcctccaccg gcgcagctca gggagcgggg gccggtctcc 

atg teg gat aac cag age tgg aac teg teg ggc 
Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly 
15 10 



ccctcctccc gctccctcct 300 

cttcaccccg gggactgggc 360 

tgctcggctg tcgcgcctcc 420 

teg gag gag gat ccg 468 
Ser Glu Glu Asp Pro 
15 



gag acg gag tec ggg ccg ccg gtg gag cgc tgc 
Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys 
2 0 2 5 



gga gtc etc aac aag 
Gly Val Leu Asn Lys 
3 0 



516 



tgg aca aac tat att cat ggg tgg cag gat cgc 
Trp Thr Asn Tyr lie His Gly Trp Gin Asp Arg 
35 40 



tgg gta gtt ttg aaa 
Trp Val Val Leu Lys 
45 



564 



aat aac act ctg agt tac tac aaa tct gaa gat 
Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu Asp 
50 55 



gag aca gag tat ggc 
Glu Thr Glu Tyr Gly 
60 



612 



tgc aga gga tec ate tgt ctt age aag get gtc ate acg cct cat gat 
Cys Arg Gly Ser lie Cys Leu Ser Lys Ala Val lie Thr Pro His Asp 
65 70 75 80 



660 



ttt gat gaa tgc cga ttt gat att agt gta aat gat agt gtt tgg tat 
Phe Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr 
85 90 95 



708 



ctt cgt get caa gat cca gat cac aga cag cag tgg ata gat gee att 
Leu Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He 
100 105 110 



756 



gaa cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt 
Glu Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
115 120 125 



804 



cga cat ggc tec atg gta tea ttg gta tec gga gca agt ggc tat tct 
Arg His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 
130 135 140 



852 



gca aca tec ace tec tea ttc aag aag ggc cac agt tta cgt gag aaa 
Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 
145 150 155 160 



900 



ctg get gaa atg gaa acc ttt aga gat ata ctg tgt aga caa gtt gat 
Leu Ala Glu Met Glu Thr Phe Arg Asp He Leu Cys Arg Gin Val Asp 
165 170 175 



948 



acc eta cag aag ttc ttt gat gec tgt get gat get gtc tec aag gat 
Thr Leu Gin Lys Phe Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp 
180 185 190 



996 



gaa ttt caa agg gat aaa gtg gta gaa gat gat gaa gat gac ttt cct 
Glu Phe Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro 
195 200 * 205 



1044 



acg aca cgt tct gat gga gac ttc ttg cat aat acc aat ggc aat aag 
Thr Thr Arg Ser Asp Gly Asp Phe Leu His Asn Thr Asn Gly Asn Lys 
210 215 220 



1092 
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gaa aag gta ttt cca cat gta aca cca aaa gga att aat ggt ata gac 1140 
Glu Lvs Val Phe Pro His Val Thr Pro Lys Gly He Asn Gly He Asp 
225 * 230 235 240 

ttt aaa ggt gag gcg ata act ttt aaa gca act act gec gga ate ctt 1188 
Phe Lys Gly Glu Ala He Thr^Phe Lys Ala Thr Thr Ala Gly He Leu 
245 250 255 

get aca ctt tct cat tgt att gag ctg atg gta aaa cgt gag gac age 1236 
Ala Thr Leu Ser His Cys He Glu Leu Met Val Lys Arg Glu Asp Ser 
260 265 270 

egg caa aag aga atg gac aag gaa act gag aag aga aga aga gtg gag 1284 
Trp Gin Lys Arg Met Asp Lys Glu Thr Glu Lys Arg Arg Arg Val Glu 
275 280 285 

aaa gca tac aaa aat gec atg aca gaa ctt aag aaa aaa tec cac ttt 1332 
Glu Ala Tyr Lys Asn Ala Met Thr Glu Leu Lys Lys Lys Ser His Phe 
290 295 300 

gaa gga cca gat tat gag gaa ggc cca aac agt ttg att aat gaa gag 1380 
Civ Glv ?ro Asp Tvr Glu Glu Gly Pro Asn Ser Leu He Asn Glu Glu 
305 310 315 ' 320 

gaa etc ttt gat get gtt gaa get get ctt gac aga caa gat aaa ata 1428 
Glu ?ne Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys He 
325 330 335 

gaa aaa rag teg cag agt gaa aag gtc agg tta cat tgg tct act tea 1476 
Giu Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Ser Thr Ser 
340 345 350 



atg cca t:: gga gat gec ttt tct tct gtg ggg act cat aga ttt gtc 
Met Pre Ser Gly Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val 
355 360 365 



1524 



1668 



caa aag att gaa gag atg gtg cag aac cac atg ace tat tea ttg cag 1572 
Gin Lvs Val Glu Glu Met Val Gin Asn His Met Thr Tyr Ser Leu Gin 
370 375 380 

gat gr.a ggt ggg gac gee aac tgg cag ttg gtt gta gaa gaa ggg gag 1620 
Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu Gly Glu 
385 390 395 400 

atg aag gta tat aga aga gaa gta gaa gaa aat ggg att gtt ctg gat 
Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly He Val Leu Asp 
405 410 415 

cct ttg aaa get ace cat gca gtt aaa ggc gtt aca gga cac gag gtc 1716 
Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His Glu Val 
420 425 430 

tgc aat tac ttc tgg aat gtt gat gtt cgc aat gat tgg gaa aca act 1764 
Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu Thr Thr 
435 440 445 



ata aaa .u: ttt cat gtg gtg gaa aca tta get gat aat gca ate ate 
lie Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala He lie 
450 455 460 



1812 
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att tat caa acg cac aag aga gtg tgg cca gcc tct cag egg gat gtc 1860 

lie Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg Asp Val 
465 470 475 480 

tta tat ctg tct gcc att cga aag ata cca get ttg aat gaa aat gac 1908 

Leu Tyr Leu Ser Ala lie Arg Lys lie Pro Ala Leu Asn Glu Asn Asp 
485 490 495 

ccg gag act tgg ata gtt tgt aat ttt tct gta gat cac age agt get 1956 

Pro Glu Thr Trp He Val Cys Asn Phe Ser Val Asp His Ser Ser Ala 

500 505 510 

cct eta aac aat cga tgt gtc cgt gcc aaa ata aac gtt get atg att 2004 

Pro Leu Asn Asn Arg Cys Val Arg Ala Lys He Asn Val Ala Met He 
515 520 525 

cgt cag acc ttg gtg age ccc cca gag gga aac cag gag att age agg 2 052 

Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn Gin Glu He Ser Arg 
530 535 540 

aac aac att eta tgc aag att aca tac gtg gcc aat gta aac cct gga 2100 

Asp Asn lie Leu Cys Lys He Thr Tyr Val Ala Asn Val Asn Pro Gly 
3-iS 550 555 560 

gaa tag gcc cca gcc tea gtg tta egg gca gtg gca aag cga gaa tat 2148 

31y Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg Glu Tyr 
565 570 575 

rca aag ttt eta aag cgt ttt act tct tac gta caa gaa aaa act gca 2196 

Pre Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala 

580 585 590 

gga aaa ccz att ttg ttc tagtattaac agtgactgaa gcaaggctgt 2244 
Gly Lys Pro He Leu Phe 
595 

gtgacattcc atgttggagg aaaaaaaaaa aaaaaaaaa 22 83 



: : : • s s e 

<ZIZ> PPT 

Bos taurus 

<4C0> 12 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
1 5 10 15 

Giu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Asn Lys 
20 25 30 

Trp Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
35 40 45 

Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 55 60 

Cys Arg Gly Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp 
65 70 75 80 

Phe Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr 
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85 90 95 

Leu Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp lie Asp Ala lie 
100 105 110 

Glu Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
115 ^120 125 

Arg His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 
130 135 140 

Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 
145 150 155 160 

Leu Ala Glu Met Glu Thr Phe Arg Asp lie Leu Cys Arg Gin Val Asp 
165 170 175 

Thr Leu Gin Lys Phe Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp 
180 185 190 

Glu Phe Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro 
195 , 200 205 

Thr Thr Arg Ser Asp Gly Asp Phe Leu His Asn Thr Asn Gly Asn Lys 
210 215 220 

Glu Lys Val Phe Pro His Val Thr Pro Lys Gly lie Asn Gly lie Asp 
225 230 235 240 

Phe Lys Glv Glu Ala lie Thr Phe Lys Ala Thr Thr Ala Gly He Leu 
245 250 255 

Ala Thr Leu Ser His Cys He Glu Leu Met Val Lys Arg Glu Asp Ser 
260 265 270 

Trp Gin Lys Arg Met Asp Lys Glu Thr Glu Lys Arg Arg Arg Val Glu 
275 280 285 

Glu Ala Tyr Lys Asn Ala Met Thr Glu Leu Lys Lys Lys Ser His Phe 
290 295 300 

Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu He Asn Glu Glu 
305 310 315 320 

Glu Phe Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys He 
325 330 335 

Glu Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Ser Thr Ser 
340 345 350 

Met Pro Ser Gly Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val 
3 55 360 3 65 

Gin Lys Val Glu Glu Met Val Gin Asn His Met Thr Tyr Ser Leu Gin 
370 375 380 

Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu Gly Glu 
385 390 395 400 

Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly He Val Leu Asp 
405 410 415 
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Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His Glu Val 
420 425 430 

Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu Thr Thr 
435 440 445 

lie Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala lie lie 
450 455 460 

lie Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg Asp Val 
465 470 475 480 

Leu Tyr Leu Ser Ala lie Arg Lys lie Pro Ala Leu Asn Glu Asn Asp 
485 490 495 

Pro Glu Thr Trp lie Val Cys Asn Phe Ser Val Asp His Ser Ser Ala 
500 505 " 510 

Pro Leu Asn Asn Arg Cys Val Arg Ala Lys lie Asn Val Ala Met lie 
515 520 525 

Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn Gin Glu lie Ser Arg 
530 535 540 

Asp Asn lie Leu Cys Lys lie Thr Tyr Val Ala Asn Val Asn Pro Gly 
545 550 555 560 

Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg Glu Tyr 
565 570 575 

Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala 
580 585 590 

Gly Lys Pro lie Leu Phe 
595 



<210> 13 
<211> 78 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> (1> . . (78) 

<400> 13 

ccc tat agt cgc tct tec tec atg 
Pro Tyr Ser Arg Ser Ser Ser Met 
1 5 

tct gat gat gtt cac aga ttc age 
Ser Asp Asp Val His Arg Phe Ser 
20 



tct tec att gat eta gtc agt gee 48 
Ser Ser lie Asp Leu Val Ser Ala 
10 15 

tec cag 78 
Ser Gin 
25 



<210> 14 
<211> 26 
<212> PRT 

<213> Homo sapiens 
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<400> 14 

Pro Tyr Ser Arg Ser Ser Ser Met Ser Ser lie Asp Leu Val Ser Ala 
15 10 15 

Ser Aso Asp Val His Arg Phe Ser Ser Gin 
20 > 25 



<210> 15 
<211> 2034 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPBPR3 

<220> 

<221> CDS 

<222> (10) . . (990) 

<400> 15 

gaattcacc atg gcc cca eta gec gac tac aag gac gac gat gac aag atg 51 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met 
1 5. 10 

teg gat aat cag age tgg aac teg teg ggc teg gag gag gat cca gag 99 
Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu 
15 * . 20 25 30 

acg gag tct ggg ccg cct gtg gag cgc tgc ggg gtc etc agt aag tgg 147 
Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp 
35 40 45 



aca aac tac att cat ggg tgg cag gat cgt tgg gta gtt ttg aaa aat 
Thr Asn Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn 
50 55 60 



aga gga tec ate tgt ctt age aag get gtc ate aca cct cac gat ttt 
Arg Gly Ser lie Cys Leu Ser Lys Ala Val He Thr Pro His Asp Phe 
80 85 90 

gat gaa tgt cga ttt gat att agt gta aat gat agt gtt tgg tat ctt 
Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr Leu 
95 * 100 105 HO 

cgt get cag gat cca gat cat aga cag caa tgg ata gat gcc att gaa 
Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He Glu 
115 120 125 

cag cac aaa act gaa tct gga tat gga tct gaa tec age ttg cgt cga 
Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg 
130 135 140 

cat ggc tea atg gtg tec ctg gtg tct gga gca agt ggc tac tct gca 
His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala 
145 150 155 



195 



aat get ctg agt tac tac aaa tct gaa gat gaa aca gag tat ggc tgc 243 
Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys 
65 70 75 



291 



339 



387 



435 



483 
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aca tec acc tct tea ttc aag aaa ggc cac agt tta cgt gag aag ttg 531 

Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu 

160 165 170 

get gaa atg gaa aca ttt aga gac ate tta tgt aga caa gtt gac acg 579 

Ala Glu Met Glu Thr Phe Arg Asp He Leu Cys Arg Gin Val Asp Thr 

175 180 185 190 

eta cag aag tac ttt gat gee tgt get gat get gtc tct aag gat gaa 627 

Leu Gin Lys Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu 

195 200 205 

ctt caa agg gat aaa gtg gta gaa gat gat gaa gat gac ttt cct aca 675 

Leu Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr 

210 215 220 

acg cgt tct gat ggt gac ttc ttg cat agt acc aac ggc aat aaa gaa 723 

Thr Arg Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu 
225 230 235 

aag tta ttt cca cat gtg aca cca aaa gga att aat ggt ata gac ttt 771 

Lys Leu Phe Pro His Val Thr Pro Lys Gly He Asn Gly lie Asp Phe 

240 245 2*50 

aaa ggg gaa gcg ata act ttt aaa gca act act get gga ate ctt gca 819 

Lys Gly Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu Ala 

255 260 265 270 

aca ctt tct cat tgt att gaa eta atg gtt aaa cgt gag gac age tgg 867 

Thr Leu Ser His Cys He Glu Leu Met Val Lys Arg Glu Asp Ser Trp 

275 280 285 

cag aag aga ctg gat aag gaa act gag aag aaa aga aga aca gag gaa 915 

Gin Lys Arg Leu Asp Lys Glu Thr Glu Lys Lys Arg Arg Thr Glu Glu 

290 295 300 

gca tat aaa aat gca atg aca gaa cga aaa aat ccc act ttg gag gac 963 

Ala Tyr Lys Asn Ala Met Thr Glu Arg Lys Asn Pro Thr Leu Glu Asp 
305 310 315 

cag att atg aag aag gee eta aca gtc tgattaatga agaagagttc 1010 

Gin He Met Lys Lys Ala Leu Thr Val 

320 325 



tttgatgctg 


ttgaagctgc 


tcttgacaga 


caagataaaa 


tagaagaaca 


gtcacagagt 


1070 


gaaaaggtga 


gattacattg 


gcctacatcc 


ttgccctctg 


gagatgeett 


ttcttctgtg 


1130 


gggacacata 


gatttgtcca 


aaagecctat 


agtegctett 


cctccatgtc 


ttccattgat 


1190 


ctagtcagtg 


cctctgatga 


tgttcacaga 


ttcagctccc 


aggttgaaga 


gatggtgcag 


1250 


aaccacatga 


cttactcatt 


acaggatgta 


ggcggagatg 


ccaattggca 


gttggttgta 


1310 


gaagaaggag 


aaatgaaggt 


atacagaaga 


gaagtagaag 


aaaatgggat 


tgttctggat 


1370 


cctttaaaag 


ctacccatgc 


agttaaaggc 


gtcacaggac 


atgaagtctg 


caattatttc 


1430 


tggaatgttg 


acgttcgcaa 


tgactgggaa 


acaactatag 


aaaactttca 


tgtggtggaa 


1490 
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acattagctg ataatgcaat catcatttat caaacacaca agagggtgtg gcctgcttct 1550 
cagcgagacg tattatatct ttctgtcatt cgaaagatac cagccttgac tgaaaatgac 1610 
cctgaaactt ggatagtttg taatttttct gtggatcatg acagtgctcc tctaaacaac 1670 
cgatgtgtcc gtgccaaaat aaatgttgct atgatttgtc aaaccttggt aagcccacca 173 0 
gagggaaacc aggaaattag cagggacaac attctatgca agattacata tgtagctaat 1790 
gtgaaccctg gaggatgggc accagcctca gtgttaaggg cagtggcaaa gcgagagtat 185 0 
cctaaatttc taaaacgttt tacttcttac gtccaagaaa aaactgcagg aaagcctatt 1910 
ttgttctagt attaacaggt actagaagat atgttttatc tttttttaac tttatttgac 1970 
taatatgact gtcaatacta aaatttagtt gttgaaagta tttactatgt tttttccgga 2030 
attc 2034 



<210> 16 
<211> 327 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPBPR3 
<400> 16 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met Ser Asp 
15 10 15 

Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu Thr Glu 
20 25 30 

Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp Thr Asn 
35 40 45 

Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn Asn Ala 
50 55 60 

Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys Arg Gly 
65 70 75 80 

Ser lie Cys Leu Ser Lys Ala Val He Thr Pro His Asp Phe Asp Glu 
85 90 95 

Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr Leu Arg Ala 
100 105 110 

Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He Glu Gin His 
115 12 0 12 5 

Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg His Gly 
130 * 135 140 

Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala Thr Ser 
145 150 155 160 

Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu 
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165 170 175 

Met Glu Thr Phe Arg Asp lie Leu Cys Arg Gin Val Asp Thr Leu Gin 
180 185 190 

Lys Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin 
195 200 205 

Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg 
210 215 220 

Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu 
225 230 235 240 

Phe Pro His Val Thr Pro Lys Gly lie Asn Gly lie Asp Phe Lys Gly 
245 250 255 

Glu Ala lie Thr Phe Lys Ala Thr Thr Ala Gly lie Leu Ala Thr Leu 
260 265 270 

Ser His Cys lie Glu Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys 
275 280 285 

Arg Leu Asp Lys Glu Thr Glu Lys Lys Arg Arg Thr Glu Glu Ala Tyr 
290 295 300 

Lys Asn Ala Met Thr Glu Arg Lys Asn Pro Thr Leu Glu Asp Gin lie 
305 310 315 ~ 320 

Met Lys Lys Ala Leu Thr Val 
325 



<210> 17 
<211> 1978 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FLAG-GPBPDNLS 

<220> 

<221> CDS 

<222> (10) . . (1860) 

<400> 17 

gaattcacc atg gcc cca eta gec gac tac aag gac gac gat gac aag atg 51 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met 
1 5 10 

teg gat aat cag age tgg aac teg teg ggc teg gag gag gat cca gag 99 
Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu 
15 20 25 " 30 

acg gag tct ggg ccg cct gtg gag cgc tgc ggg gtc etc agt aag tgg 147 
Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp 
35 40 45 

aca aac tac att cat ggg tgg cag gat cgt tgg gta gtt ttg aaa aat 195 
Thr Asn Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn 
50 55 60 
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aat get ctg agt tac tac aaa tct gaa gat gaa aca gag tat ggc tgc 24 3 
Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys 
65 70 75 

aga gga tec ate tgt ctt age aag get gtc ate aca cct cac gat ttt 291 
Arg Gly Ser lie Cys Leu Ser Lys Ala Val lie Thr Pro His Asp Phe 
80 85 90 

cat gaa tgc cga ttt gat att agt gta aat gat agt gtt tgg tac ctt 339 
Asp Glu Cys Arg Phe Asp lie Ser Val Asn Asp Ser Val Trp Tyr Leu 
95 100 105 110 



cgt get cag gat cca gat cat aga cag caa tgg ata gat gee att gaa 
Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp lie Asp Ala lie Glu 
115 12 0 125 

cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt cga 
Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg 
130 135 140 



387 



435 



cat ggc tea atg gtg tec ctg gtg tct gga gca agt ggc tac tct gca 483 
His Glv Ser Met Val Ser' Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala 
145 150 155 

aca tec acc tct tea ttc aag aaa ggc cac agt tta cgt gag aag ttg 531 
Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu 
160 165 170 

get gaa atg gaa aca ttt aga gac ate tta tgt aga caa gtt gac acg 579 
Ala Glu Met Glu Thr Phe Arg Asp lie Leu Cys Arg Gin Val Asp Thr 
175 180 185 190 

eta cag aag tac tct gat gee tgt get gat get gtc tct aag gat gaa 627 
Leu Gin Lvs Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu 
195 o 200 205 

ctt caa agg gat aaa gtg gta gaa gat gat gaa gat gac ttt cct aca 675 
Leu Gin Arg Asd Lvs Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr 
210 215 220 

acg cgt tct gat ggt gac ttc ttg cat agt acc aac ggc aat aaa gaa 723 
Thr Arg Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu 
225 230 235 

aag tta ttt cca cat gtg aca cca aaa gga att aat ggt ata gac ttt 771 
Lys Leu Phe Pro His Val Thr Pro Lys Gly He Asn Gly He Asp Phe 
240 245 250 



aaa ggg gaa gcg ata act ttt aaa gca act act get gga ate ctt gca 
Lys Gly Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu Ala 
255 ' 260 265 270 

aca ctt tct cat tgt att gaa eta atg gtt aaa cgt gag gac age tgg 
Thr Leu Ser His Cys He Glu Leu Met Val Lys Arg Glu Asp Ser Trp 
275 280 285 



819 



867 



cag aag aga ctg gat aag gaa act gag cac ttt gga gga cca gat tat 915 
Gin Lys Arg Leu Asp Lys Glu Thr Glu His Phe Gly Gly Pro Asp Tyr 
290 295 300 
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gaa gaa ggc cct aac agt ctg att aat gaa gaa gag ttc ttt gat get 963 
Glu Glu Gly Pro Asn Ser Leu lie Asn Glu Glu Glu Phe Phe Asp Ala 
305 310 315 

gtt gaa get get ctt gac aga caa gat aaa ata gaa gaa cag tea cag 1011 
Val Glu Ala Ala Leu Asp Arg Gin Asp Lys lie Glu Glu Gin Ser Gin 
220 325 330 

agt gaa aag gtg aga tta cat tgg cct aca tec ttg ccc tct gga gat 1059 
Ser Glu Lys Val Arg Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp 
335 340 345 350 

gec ttt tct tct gtg- ggg aca cat aga ttt gtc caa aag ccc tat agt 1107 
Ala Phe Ser Ser Val Gly Thr His Arg Phe Val Gin Lys Pro Tyr Ser 
355 360 * 365 

cgc tct tec tec atg tct tec att gat eta gtc agt gee tct gat gat 1155 
Arg Ser Ser Ser Met Ser Ser lie Asp Leu Val Ser Ala Ser Asp Asp 
370 375 380 

gtt cac aga ttc age tec cag gtt gaa gag atg gtg cag aac cac atg 1203 
Val His Arg Phe Ser Ser Gin Val Glu Glu Met Val Gin Asn His Met 
385 390 395 

act tac tea tta cag gat gta ggc gga gat gee aat tgg cag ttg gtt 1251 
Thr Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val 
400 405 410 

gta gaa gaa gga gaa atg aag gta tac aga aga gaa gta gaa gaa aat 1299 
Val Glu Giu Gly Glu Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn 
415 420 425 430 

ggg att gtt ctg gat cct tta aaa get acc cat gca gtt aaa ggc gtc 1347 
Gly lie Val Leu Asp Pro Leu Lys Ala Thr His Ala Val Lys Gly Val 
435 440 445 

aca gga cat gaa gtc tgc aat tat ttc tgg aat gtt gac gtt cgc aat 1395 
Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn 
450 455 460 

gac tgg gaa aca act ata gaa aac ttt cat gtg gtg gaa aca tta get 1443 
Asp Trp Glu Thr Thr lie Glu Asn Phe His Val Val Glu Thr Leu Ala 
465 470 475 

gat aat gca ate ate att tat caa aca cac aag agg gtg tgg cct get 1491 
Asp Asn Ala lie lie lie Tyr Gin Thr His Lys Arg Val Trp Pro Ala 
48C 485 490 

tct cac; ~ga gac gta tta tat ctt tct gtc att cga aag ata cca gee 1539 
Ser Gin Arg Asp Val Leu Tyr Leu Ser Val lie Arg Lys lie Pro Ala 
495 500 505 ^ 510 

ttg act gaa aat gac cct gaa act tgg ata gtt tgt aat ttt tct gtg 1587 
Leu Thr Glu Asn Asp Pro Glu Thr Trp lie Val Cys Asn Phe Ser Val 
515 520 525 

gat cat cjac agt get cct eta aac aac cga tgt gtc cgt gee aaa ata 1635 
Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys Val Arg Ala Lys lie 
530 535 540 

aat gtt art atg att tgt caa acc ttg gta age cca cca gag gga aac 1683 
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Asn Val Ala Met lie Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn 
545 550 555 

cag gaa att age agg gac aac att eta tgc aag att aca tat gta get 1731 
Gin Glu lie Ser Arg Asp Asn lie Leu Cys Lys He Thr Tyr Val Ala 
560 565 570 

aat gtg aac cct gga gga tag N gca cca gee tea gtg tta agg gca gtg 177 9 
Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val 
575 580 585 590 

gca aag cga gag tat cct aaa ttt eta aaa cgt ttt act tct tac gtc 1827 
Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val 
595 600 605 

caa gaa aaa act gca gga aag cct att ttg ttc tagtattaac aggtactaga 18 80 
Gin Glu Lys Thr Ala Gly Lys Pro He Leu Phe 
610 615 

agatatgttt tatctttttt taactttatt tgactaatat gactgtcaat actaaaattt 1940 

agttgttgaa agtatttact atgttttttc eggaatte 1978 

<210> 18 
<211> 617 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FLAG - GPBPDNLS 
<400> 18 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met Ser Asp 
15 10 15 

Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu Thr Glu 
20 25 30 

Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp Thr Asn 
35 40 45 

Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn Asn Ala 
50 55 60 

Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys Arg Gly 
65 70 75 80 

Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp Phe Asp Glu 
85 90 95 

Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr Leu Arg Ala 
100 105 110 

Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He Glu Gin His 
115 120 125 

Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg His Gly 
130 135 140 

Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala Thr Ser 
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145 150 155 160 

Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu 
165 170 175 

Met Glu Thr Phe Arg Asp lie Leu Cys Arg Gin Val Asp Thr Leu Gin 
180 185 190 

Lys Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin 
195 200 205 

Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg 
210 215 220 

Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu 
225 230 235 240 

Phe Pro His Val Thr Pro Lys Gly lie Asn Gly lie Asp Phe Lys Gly 
245 250 ' 255 

Glu Ala lie Thr Phe Lys Ala Thr Thr Ala Gly lie Leu Ala Thr Leu 
260 265 270 

Ser His Cys lie Glu Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys 
275 280 285 

Arg Leu Asp Lys Glu Thr Glu His Phe Gly Gly Pro Asp Tyr Glu Glu 
290 295 300 

Gly Pro Asn Ser Leu lie Asn Glu Glu Glu Phe Phe Asp Ala Val Glu 
305 310 315 320 

Ala Ala Leu Asp Arg Gin Asp Lys lie Glu Glu Gin Ser Gin Ser Glu 
325 330 335 

Lys Val Arg Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Ala Phe 
340 345 350 

Ser Ser Val Gly Thr His Arg Phe Val Gin Lys Pro Tyr Ser Arg Ser 
355 360 365 

Ser Ser Met Ser Ser lie Asp Leu Val Ser Ala Ser Asp Asp Val His 
370 375 380 

Arg Phe Ser Ser Gin Val Glu Glu Met Val Gin Asn His Met Thr Tyr 
385 390 395 400 

Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu 
405 410 415 

Glu Gly Glu Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly lie 
• 420 425 430 

Val Leu Asp Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly 
435 440 445 

His Glu Val Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp 
450 , 455 460 

Glu Thr Thr lie Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn 
465 470 475 480 
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Ala He He He 



Arg Asp Val Leu 
500 

Glu Asn Asp Pro 
515 

Asp Ser Ala Pro 
530 

Aia Men He Cys 
545 

He Ser Arg Asp 



Asn Pro Gly Gly 
580 

Arg Glu Tyr Pro 

Lys Thr Ala Gly 
61C 



Tyr Gin Thr Kis 
485 

Tyr Leu Ser Val 



Glu Thr Trp 'He 
520 

Leu Asn Asn Arg 
535 

Gin Thr Leu Val 
550 

Asn He Leu Cys 
565 

Trp Ala Pro Ala 



Lys Phe Leu Lys 
600 

Lys Pro He Leu 
615 



Lys Arg Val Trp 
490 

He Arg Lys He 
505 

Val Cys Asn Phe 



Cys Val Arg Ala 
540 

Ser Pro Pro Glu 
555 

Lys He Thr Tyr 
570 

Ser Val Leu Arg 
585 

Arg Phe Thr Ser 



Phe 



Pro Ala Ser Gin 
495 

Pro Ala Leu Thr 
510 

Ser Val Asp His 
525 

Lys He Asn Val 



Gly Asn Gin Glu 
560 

Val Ala Asn Val 
575 

Ala Val Ala Lys 
590 

Tyr Val Gin Glu 
605 



<210> 

<2i:> 19"S 

<2i2> o::a 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FLAG-GPBPDSXY 

<220> 

<221> CZS 

<222> -10, . . a857) 



<400> 19 

gaattcacc atg gcc cca eta gec gac tac aag gac gac gat gac aag atg 51 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met 
15 10 



teg aat aat cag age tgg aac teg teg ggc teg gag gag gat cca gag 
Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu 
15 20 25 30 

acg gag tct ggg ccg cct gtg gag cgc tgc ggg gtc etc agt aag tgg 
Thr Glu Ser Civ Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp 
35 40 45 

aca aac tac att cat ggg tgg cag gat cgt tgg gta gtt ttg aaa aat 
Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn 
50 55 60 

aat get eta agt tac tac aaa tct gaa gat gaa aca gag tat ggc tgc 
Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys 
65 70 75 



99 



147 



195 



243 
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aga gga tec ate tgt ctt age aag get gtc ate aca cct cac gat ttt 291 
Arg Gly Ser lie Cys Leu Ser Lys Ala Val lie Thr Pro His Asp Phe 
80 85 90 

gat gaa tgt cga ttt gat att agt gta aat gat agt gtt tgg tat ctt 339 
Asp Glu Cys Arg Phe Asp lie Ser Val Asn Asp Ser Val Trp Tyr Leu 
95 100 105 110 

cgt get cag gat cca gat cat aga cag caa tgg ata gat gee att gaa 337 
Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp lie Asp Ala lie Glu 
115 120 125 

cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt cga 435 
Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg 
130 13S 140 

cac ggc aaa gge cac agt tta cgt gag aag ttg get gaa atg gaa aca. 483 
His Gly Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr 
145 150 155 

ttt aga gac ate tta tgt aga caa gtt gac acg eta cag aag tac ttt 531 
Phe Arg Asp lie Leu Cys Arg Gin Val Asp Thr Leu Gin Lys Tyr Phe 
:S0 165 170 

gat gec tgt get gat get gtc tct aag gat gaa ctt caa agg gat aaa 579 
Asp Aia Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys 
I?* 180 185 190 

gtc gta gaa gat gat gaa gat gac ttt cct aca acg cgt tct gat ggt 627 
Val Yal Olu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly 
195 200 205 

gac ttc ttg cat agt ace aac ggc aat aaa gaa aag tta ttt cca cat 675 
Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu Phe Pro His 
210 215 220 

ata aca cca aaa gga att aat ggt ata gac ttt aaa ggg gaa gcg ata 723 
Val Thr Pro Lys Gly lie Asn Gly lie Asp Phe Lys Gly Glu Ala He 
225 230 235 

act ttt aaa gca act act get gga ate ctt gca aca ctt tct cat tgt 771 
Thr Phe Lys Ala Thr Thr Ala Gly He Leu Ala Thr Leu Ser His Cys 
240 245 250 

att gaa eta atg gtt aaa cgt gag gac age tgg cag aag aga ctg gat 819 
He Glu Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys Arg Leu Asp 
255 260 265 270 

aag gaa act gag aag aaa aga aga aca gag gaa gca tat aaa aat gca 867 
Lys Glu Thr Glu Lys Lys Arg Arg Thr Glu Glu Ala Tyr Lys Asn Ala 
275 280 285 

atg aca gaa ctt aag aaa aaa tec cac ttt gga gga cca gat tat gaa 915 
Met Thr Glu Leu Lys Lys Lys Ser His Phe Gly Gly Pro Asp Tyr Glu 
290 295 300 

gaa ggc cct aac agt ctg att aat gaa gaa gag ttc ttt gat get gtt 963 
Glu Gly Pro Asn Ser Leu He Asn Glu Glu Glu Phe Phe Asp Ala Val 
305 310 315 
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gaa get get ctt gac aga caa gat aaa ata gaa gaa cag tea cag agt 1011 
Glu Ala Ala Leu Asp Arg Gin Asp Lys lie Glu Glu Gin Ser Gin Ser 
320 325 330 

gaa aag gtg aga tta cat tgg cct aca tec ttg ccc tct gga gat gee 
Glu Lys Val Arg Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Ala 
335 ~ 340 „ 345 350 

ttt tct tct gtg ggg aca cat aga ttt gtc caa aag ccc tat agt cgc 
Phe Ser Ser Val Gly Thr His Arg Phe Val Gin Lys Pro Tyr Ser Arg 
355 360 365 

tct tec tec atg tct tec att gat eta gtc agt gec tct gat gat gtt 1155 
Ser Ser Ser Met Ser Ser lie Asp Leu Val Ser Ala Ser Asp Asp Val 
370 375 380 



1059 



1107 



cac aga ttc age tec cag gtt gaa gag atg gtg cag aac cac atg act 
His Arg Phe Ser Ser Gin Val Glu Glu Met Val Gin Asn His Met Thr - 
385 390 395 



1203 



cac tea tta cag gat gta ggc gga gat gee aat tgg cag ttg gtt gta 1251 
Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val 
400 405 410 

gaa gaa gga gaa acg aag gta tac aga aga gaa gta gaa gaa aat ggg 1299 
Glu Glu Gly Glu Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly 
415 420 425 430 

att gtt ctg gac cct tta aaa get acc cat gca gtt aaa ggc gtc aca 1347 
lie Val Leu Asp Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr 
435 440 445 

gga cat gaa gtc cgc aat tat ttc tgg aat gtt gac gtt cgc aat gac 
Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp 
450 455 460 

egg gaa aca act ata gaa aac ttt cat gtg gtg gaa aca tta get gat 
Trp Glu Thr Thr lie Glu Asn Phe His Val Val Glu Thr Leu Ala Asp 
465 470 475 

aat gca ate ate att tat caa aca cac aag agg gtg tgg cct get tct 
Asn Ala lie lie lie Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser 
480 485 490 

cag cga gac gta tta tat ctt tct gtc att cga aag ata cca gee ttg 
Gin Arg Asp Val Leu Tyr Leu Ser Val lie Arg Lys lie Pro Ala Leu 
495 " 500 505 510 

act gaa aat gac cct gaa act tgg ata gtt tgt aat ttt tct gtg gat 
Thr Glu Asn Asp Pro Glu Thr Trp lie Val Cys Asn Phe Ser Val Asp 
515 520 525 

cac gac age gec cct eta aac aac cga tgt gtc cgt gee aaa ata aat 
His Asp Ser Ala Pro Leu Asn Asn Arg Cys Val Arg Ala Lys lie Asn 
530 535 540 

gtt get atg att tgt caa acc ttg gta age cca cca gag gga aac cag 
Val Ala Met lie Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn Gin 
545 * 550 555 

gaa act age agg gac aac att eta tgc aag att aca tat gta get aac 1731 



1395 



1443 



1491 



1539 



1587 



1635 



1683 
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Glu lie Ser Arg Asp Asn lie Leu Cys Lys lie Thr Tyr Val Ala Asn 
560 565 570 

gtg aac cct gga gga tgg gca cca gcc tea gtg tta agg gca gtg gca 1779 
Val Asn Pro Gly Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala 
575 580 585 590 

aag cga gag tat cct aaa ttt 'eta aaa cgt ttt act tct tac gtc caa 1827 
Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin 
595 600 605 

gaa aaa act gca gga aag cct att ttg ttc tagtattaac aggtactaga 1877 
Glu Lys Thr Ala Gly Lys Pro lie Leu Phe 
610 615 

agatatgttt tatctttttt taactttatt tgactaatat gactgtcaat actaaaattt 1937 

agttgttgaa agtatttacf atgttttttc eggaatte 1975 

<210> 20 
<211> 616 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FLAG-GPBPDSXY 
<400> 20 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met Ser Asp 
15 10 15 

Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu Thr Glu 
20 25 30 

Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp Thr Asn 
35 40 45 

Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn Asn Ala 
50 55 60 

Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys Arg Gly 
65 70 75 80 

Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp Phe Asp Glu 
8 5 90 95 

Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr Leu Arg Ala 
100 105 * 110 

Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He Glu Gin His 
115 120 125 

Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg His Gly 
130 135 140 

Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr Phe Arg 
145 150 155 160 

Asp He Leu Cys Arg Gin Val Asp Thr Leu Gin Lys Tyr Phe Asp Ala 
165 170 175 
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Cys Ala Asn Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys Val Val 
180 185 190 

Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly Asp Phe 
195 200 205 

Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu Phe Pro His Val Thr 
• 210 215 220 

Pro Lys Gly lie Asn Gly lie Asp Phe Lys Gly Glu Ala lie Thr Phe 
225 230 235 240 

Lys Ala Thr Thr Ala Gly lie Leu Ala Thr Leu Ser His Cys lie Glu 
245 250 255 

Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys Arg Leu Asp Lys Glu 
260 265 270 

Thr Glu Lys Lys Arg Arg Thr Glu Glu Ala Tyr Lys Asn Ala Met Thr 
275 280 285 

31u Leu Lys Lys Lys Ser His Phe Gly Gly Pro Asp Tyr Glu Glu Gly 
290 295 300 

Pre Asn Ser Leu lie Asn Glu Glu Glu Phe Phe Asp Ala Val Glu Ala 
lis 310 315 320 

A.a Leu Asp Arg Gin Asp Lys He Glu Glu Gin Ser Gin Ser Glu Lys 
325 330 335 

vai Arn Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Ala Phe Ser 
34.0 345 350 

Ser Val Gly Thr His Arg Phe Val Gin Lys Pro Tyr Ser Arg Ser Ser 
355 360 365 

3*r Me: Ser Ser He Asp Leu Val Ser Ala Ser Asp Asp Val His Arg 
2-C 375 380 

Lier Gin Val Glu Glu Met: Val Gin Asn His Met Thr Tyr Ser 

355 390 395 400 

Leu Gir. Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu 
405 410 415 

Glv Glu Met Lvs Val Tyr Arg Arg Glu Val Glu Glu Asn Gly He Val 
420 425 430 

Leu Asp Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His 
435 440 445 

Giu Val Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu 
450 * 455 460 

Thr Thr He Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala 
465 ~ ~ 470 475 480 

lie He He Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg 
485 490 495 
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Asp Val Leu Tyr Leu Ser Val lie Arg Lys lie Pro Ala Leu Thr Glu 

500 505 510 

Asn Asp Pro Glu Thr Trp lie Val Cys Asn Phe Ser Val Asp His Asp 

515 520 525 

Ser Ala Pro Leu Asn Asn Arg Cys Val Arg Ala Lys lie Asn Val Ala 

530 535 * 540 

Met lie Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn Gin Glu lie 

545 550 555 560 

Ser Arg Asp Asn lie Leu Cys Lys lie Thr Tyr Val Ala Asn Val Asn 

565 570 575 

Pro Gly Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg 

580 585 590 

Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys 

595 600 605 

Thr Ala Gly Lys Pro lie Leu Phe 

610 615 



<210> 21 
<211> 1915 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
FLAG-GPBPDSXY/NLS 

<220> 
<221> CDS 

<222> (10) . . (1797) 
<400> 21 

gaattcacc atg gcc cca eta gec gac tac aag gac gac gat gac aag atg 51 
Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met 
15 10 

teg gat aat cag age tgg aac teg teg ggc teg gag gag gat cca gag 9 9 
Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu 
15 20 25 30 

acg gag tct ggg ccg cct gtg gag cgc tgc ggg gtc etc agt aag tgg 147 
Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp 
35 40 45 

aca aac tac att cat ggg tgg cag gat cgt tgg gta gtt ttg aaa aat 195 
Thr Asn Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn 
50 55 60 

aat get ctg agt tac tac aaa tct gaa gat gaa aca gag tat ggc tgc 243 
Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys 
65 70 75 

aga gga tec ate tgt ctt age aag get gtc ate aca cct cac gat ttt 291 
Arg Gly Ser lie Cys Leu Ser Lys Ala Val lie Thr Pro His Asp Phe 
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80 85 90 

gat gaa tgt cga ttt gat att agt gta aat gat agt gtt tgg tat ctt 33 9 

Asp Glu Cys Arg Phe Asp lie Ser Val Asn Asp Ser Val Trp Tyr Leu 

95 100 105 110 



cgt get cag gat cca gat cataga cag caa tgg ata gat gec att gaa 
Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp lie Asp Ala lie Glu 
115 120 125 



387 



cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt cga 435 
Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg 
130 135 140 

cat ggc aaa ggc cac agt tta cgt gag aag ttg get gaa atg gaa aca 4 83 
His Gly Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr 
145 150 155 

ttt aga gac ate tta tgt aga caa gtt gac acg eta cag aag tac ttt 531 
Phe Arg Aso lie Leu Cys Arg Gin Val Asp Thr Leu Gin Lys Tyr Phe 
160 165 170 

gat gee tgt get gat get gtc tct aag gat gaa ctt caa agg gat aaa 579 
Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys 
175 180 185 190 

gtg gta gaa gat gat gaa gat gac ttt cct aca acg cgt tct gat ggt 627 
Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly 
195 200 205 

gac ttc ttg cat agt ace aac ggc aat aaa gaa aag tta ttt cca cat 675 
Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu Phe Pro His 
210 215 220 

gtg aca cca aaa gga att aat ggt ata gac ttt aaa ggc gaa gcg ata 
Val Thr Pro Lys Gly lie Asn Gly lie Asp Phe Lys Gly Glu Ala lie 
225 230 235 

act ttt aaa gca act act get gga ate ctt gca aca ctt tct cat tgt 
Thr Phe Lys Ala Thr Thr Ala Gly He Leu Ala Thr Leu Ser His Cys 
240 245 250 

att gaa eta atg gtt aaa cgt gag gac age tgg cag aag aga ctg gat 
He Glu Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys Arg Leu Asp 
255 260 265 270 

aag gaa act gag cac ttt gga gga cca gat tat gaa gaa ggc cct aac 
Lys Glu Thr Glu His Phe Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn 
275 280 235 

agt ctg att aat gaa gaa gag ttc ttt gat get gtt gaa get get ctt 
Ser Leu He Asn Glu Glu Glu Phe Phe Asp Ala Val Glu Ala Ala Leu 
290 295 300 

gac aga caa gat aaa ata gaa gaa cag tea cag agt gaa aag gtg aga 
Asp Arg Gin Asp Lys He Glu Glu Gin Ser Gin Ser Glu Lys Val Arg 
305 310 315 

tta cat tgg cct aca tec ttg ccc tct gga gat gee ttt tct tct gtg 1011 
Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Ala Phe Ser Ser Val 
320 325 330 



723 



771 



819 



867 



915 



963 
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ggg aca cat aga ttt gtc caa aag ccc tat agt cgc tct tec tec atg 1059 
Gly Thr His Arg Phe Val Gin Lys Pro Tyr Ser Arg Ser Ser Ser Mec 
335 340 345 350 

tct tec att gat eta gtc agt gec tct gat gat gtt cac aga ttc age 1107 
Ser Ser lie Asp Leu Val Ser Ala Ser Asp Asp Val His Arg Phe Ser 
355 360 365 

tec cag gtt gaa gag atg gtg cag aac cac atg act tac tea tta cag 115 5 
Ser Gin Val Glu Glu Met Val Gin Asn His Met Thr Tyr Ser Leu Gin 
370 375 * 380 

gat gta ggc gga gat gec aat tgg cag ttg get gta gaa gaa gga gaa 1203 
Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu Gly Glu 
385 390 395 

atg aag gta tac aga aga gaa gta gaa gaa aat ggg att gtt ctg gat 1251 
Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly lie Val Leu Asp 
400 405 410 

cct tta aaa gec acc cat gca gtt aaa ggc gtc aca gga cat gaa gtc 12 99 
Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His Glu Val 
415 . 420 425 * 430 

tgc aat tat ttc tgg aat gtt gac gtt cgc aat gac tgg gaa aca act 134 7 
Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu Thr Thr 
435 440 445 

ata gaa aac ttt cat gtg gtg gaa aca tta get gat aat gca ate ate 1395 
lie Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala lie lie 
450 455 460 

att tat caa aca cac aag agg gtg tgg cct get tct cag cga gac gta 144 3 
lie Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg Asp Val 
465 470 475 

tta tat ctt tct gtc att cga aag ata cca gec ttg act gaa aat gac 1491 
Leu Tyr Leu Ser Val lie Arg Lys He Pro Ala Leu Thr Glu Asn Asp 
480 485 490 

cct gaa act tgg ata gtt tgt aat ttt tct gtg gat cat gac agt get 153 9 
Pro Glu Thr Trp He Val Cys Asn Phe Ser Val Asp His Asp Ser Ala 
495 500 505 510 

cct eta aac aac cga tgt gtc cgt gec aaa ata aat gtt get atg att 1587 
Pro Leu Asn Asn Arg Cys Val Arg Ala Lys He Asn Val Ala Met He 
515 520 525 

tgt caa acc ttg gta age cca cca gag gga aac cag gaa att age agg 1635 
Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn Gin Glu He Ser Arg 
530 535 540 

gac aac att eta tgc aag att aca tat gta get aat gtg aac cct gga 1683 
Asp Asn He Leu Cys Lys He Thr Tyr Val Ala Asn Val Asn Pro Gly 
545 ' 550 555 

gga tgg gca cca gec tea gtg tta agg gca gtg gca aag cga gag tat 1731 
Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg Glu Tyr 
560 565 570 
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cct aaa ttr eta aaa cgt ttt act tct tac gtc caa gaa aaa act gca 1779 
Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala 
575 580 585 590 

gga aag cct att ttg ttc tagtattaac aggtactaga agatatgttt 1827 
Gly Lys Pro lie Leu Phe 
595 

tatctttttt taactttatt tgactaatat gactgtcaat actaaaattt agttgttgaa 1887 
agtatttact atgttttttc eggaatte 1915 



<210> 22 
<211> 596 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
FLAG-GPBPDSXY/NLS 

<400> 22 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met Ser Asp 
1 5 10 15 

Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu Thr Glu 
20 25 30 

Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp Thr Asn 
35 40 45 

Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn Asn Ala 
50 55 60 

Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys Arg Gly 
65 70 75 80 

Ser lie Cys Leu Ser Lys Ala Val lie Thr Pro His Asp Phe Asp Glu 
85 90 95 

Cys Arg Phe Asp lie Ser Val Asn Asp Ser Val Trp Tyr Leu Arg Ala 
100 105 110 

Gin Asp Pro Asp His Arg Gin Gin Trp lie Asp Ala lie Glu Gin His 
115 120 125 

Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg His Gly 
130 135 140 

Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr Phe Arg 
145 150 155 160 

Asp lie Leu Cys Arg Gin Val Asp Thr Leu Gin Lys Tyr Phe Asp Ala 
165 170 175 

Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys Val Val 
180 185 190 

Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly Asp Phe 
195 200 205 
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Leu His Ser Thr 
210 

Pro Lys Gly lie 



Lye Ala Thr Thr 



Leu Met Val Lys 
260 

Thr Glu His Phe 
275 

lie Asn Glu Glu 
290 

Gin Asp Lys lie 
305 

Trp Prz> Thr Ser 



His A:n Pho Val 
340 

He Asp Leu Val 

Val Glu Glu Met 

3 7 0 

Gly Gly Asp Ala 
385 

Val Tyr Arg Arg 



Lys Ala Thr His 
420 

Ty r Phe Trp Asn 
435 

Asn Phe r.iz Val 
45C 

Glr. Thr K . c Lys 
465 

Leu Ser Val lie 



Thr Trp He Val 
500 

Asn Asr. Arg Cys 
515 



Asn Gly Asn Lys 
215 

Asn Gly lie Asp 
230 

Ala Gly lie' Leu 
245 

Arg Glu Asp Ser 



Gly Gly Pro Asp 
280 

Glu Phe Phe Asp 
295 

Glu Glu Gin Ser 
310 

Leu Pro Ser Gly 
325 

Gin Lys Pro Tyr 



Ser Ala Ser Asp 
360 

Val Gin Asn His 
375 

Asn Trp Gin Leu 
390 

Glu Val Glu Glu 
405 

Ala Val Lys Gly 



Val Asp Val Arg 
440 

Val Glu Thr Leu 
455 

Arg Val Trp Pro 

470 

Arg Lys lie Pro 
485 

Cys Asn Phe Ser 



Val Arg Ala Lys 
520 



Glu Lys Leu Phe 
220 

Phe Lys Gly Glu 
235 

Ala Thr Leu Ser 
250 

Trp Gin Lys Arg 
265 

Tyr Glu Glu Gly 



Ala Val Glu Ala 
300 

Gin Ser Glu Lys 
315 

Asp Ala Phe Ser 
330 

Ser Arg Ser Ser 
345 

Asp Val His Arg 



Met Thr Tyr Ser 
380 

Val Val Glu Glu 
395 

Asn Gly lie Val 
410 

Val Thr Gly His 
425 

Asn Asp Trp Glu 



Ala Asp Asn Ala 
460 

Ala Ser Gin Arg 
475 

Ala Leu Thr Glu 
490 

Val Asp His Asp 
505 

lie Asn Val Ala 



Pro His Val Thr 



Ala He Thr Phe 
240 

His Cys lie Glu 
255 

Leu Asp Lys Glu 
270 

Pro Asn Ser Leu 
285 

Ala Leu Asp Arg 



Val Arg Leu His 
320 

Ser Val Gly Thr 
335 

Ser Met Ser Ser 
350 

Phe Ser Ser Gin 
365 

Leu Gin Asp Val 



Gly Glu Met Lys 
400 

Leu Asp Pro Leu 
415 

Glu Val Cys Asn 
430 

Thr Thr lie Glu 
445 

He He lie Tyr 



Asp Val Leu Tyr 
4 80 

Asn Asp Pro Glu 
495 

Ser Ala Pro Leu 
510 

Met He Cys Gin 
525 
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Thr Leu Val Ser 
530 

lie Leu Cys Lys 
545 

Ala Pro Ala Ser 



Phe Leu Lys Arg 
580 

Pro lie Leu Phe 
595 



Pro Pro Glu Gly 
535 

lie Thr Tyr Val 
550 

Val Leu Arg Ala 
565 

Phe Thr Ser Tyr 



Asn Gin Glu lie 
540 

Ala Asn Val Asn 
555 

Val Ala Lys Arg 
570 

Val Gin Glu Lys 
585 



Ser Arg Asp Asn 



Pro Gly Gly Trp 
560 

Glu Tyr Pro Lys 
575 

Thr Ala Gly Lys 
590 



<210> 23 
<211> 2038 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPBP-D1 69A 

<220> 
<221> CDS 

<222> (10) . . (1920) 
<400> 23 

gaattcacc atg gcc cca eta gec gac tac aag gac gac gat gac aag atg 51 
Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met 
15 10 



teg gat aat cag age tgg aac teg teg ggc teg gag gag gat cca gag 
Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu 
15 20 25 30 



99 



acg gag tct ggg ccg cct gtg gag cgc tgc ggg gtc etc agt aag tgg 147 
Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp 
35 40 45 

aca aac tac att cat ggg tgg cag gat cgt tgg gta gtt ttg aaa aat 195 
Thr Asn Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn 
50 55 60 

aat get ctg agt tac tac aaa tct gaa gat gaa aca gag tat ggc tgc 243 
Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys 
65 70 75 

aga gga tec ate tgt ctt age aag get gtc ate aca cct cac gat ttt 
Arg Gly Ser lie Cys Leu Ser Lys Ala Val lie Thr Pro His Asp Phe 
80 85 90 

gat gaa tgt cga ttt gat att agt gta aat gat agt gtt tgg tat ctt 33 9 
Asp Glu Cys Arg Phe Asp lie Ser Val Asn Asp Ser Val Trp Tyr Leu 
95 100 105 110 



291 



cgt get cag cat cca gat cat aga cag caa tgg ata gat gcc att gaa 
Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He Glu 
115 120 125 



387 
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cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt cga 435 

Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg 

13 0 13 5 14 0 

cat ggc tea atg gtg tec ctg gtg tct gga gca agt ggc tac tct gca 483 

His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala 

145 150 155 

aca tec ace tct tea ttc aag aaa ggc cac agt tta cgt gag aag ttg 531 

Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu 

160 165 170 

get gaa atg gaa aca ttt aga gee ate tta tgt aga caa gtt gac acg 579 

Ala Glu Met Glu Thr Phe Arg Ala lie Leu Cys Arg Gin Val Asp Thr 

175 180 185 190 



eta cag aag tac ttt gat gee tgt get gat get gtc tct aag gat gaa 
Leu Gin Lys Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu 
195 200 205 



627 



ctt caa agg gat aaa gtg gta gaa gat gat gaa gat gac ttt cct aca 675 
Leu Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr 
210 * 215 220 

acg cgt tct gat ggt gac ttc ttg cat agt- ace aac ggc aat aaa gaa 723 
Thr Arg Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu 
225 230 235 

aag tta ttt cca cat gtg aca cca aaa gga att aat ggt ata gac ttt 771 
Lys Leu Phe Pro His Val Thr Pro Lys Gly lie Asn Gly lie Asp Phe 
240 245 250 

aaa ggg gaa gcg ata act ttt aaa gca act act get gga ate ctt gca 819 
Lys Gly Glu Ala lie Thr Phe Lys Ala Thr Thr Ala Gly lie Leu Ala 
255 260 265 270 

aca ctt tct cat tgt att gaa eta atg gtt aaa cgt gag gac age tgg 867 
Thr Leu Ser His Cys lie Glu Leu Met Val Lys Arg Glu Asp Ser Trp 
275 280 285 

cag aag aga ctg gat aag gaa act gag aag aaa aga aga aca gag gaa 915 
Gin Lys Arg Leu Asp Lys Glu Thr Glu Lys Lys Arg Arg Thr Glu Glu 
290 295 300 

gca tat aaa aat gca atg aca gaa ctt aag aaa aaa tec cac ttt gga 963 
Ala Tyr Lys Asn Ala Met Thr Glu Leu Lys Lys Lys Ser His Phe Gly 
305 310 315 

gga cca gat tat gaa gaa ggc cct aac agt ctg att aat gaa gaa gag 1011 
Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu lie Asn Glu Glu Glu 
320 325 330 

ttc ttt gat get gtt gaa get get ctt gac aga caa gat aaa ata gaa 1059 
Phe Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys lie Glu 
335 340 345 350 

gaa cag tea cag agt gaa aag gtg aga tta cat tgg cct aca tec ttg 1107 
Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Pro Thr Ser Leu 
355 360 365 

ccc tct gga gat gee ttt tct tct gtg ggg aca cat aga ttt gtc caa 1155 
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Pro ser Gly Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val Gin 
370 375 380 

aag ccc cat age cgc tct tec tec atg tct tec att gat eta gtc agt 
Lvs Pro Tvr Ser Arg Ser Ser Ser Met Ser Ser He Asp Leu Val Ser 
385 390 395 



cag aac cac atg act tac tea tta cag gat gta ggc gga gat gec aat 
Gin Asn His Met Thr Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn 
415 420 425 430 



1203 



gec tct gat gat gtt cac aga ttc age tec cag gtt gaa gag atg gtg 1251 
Aia Ser Asp Asp Val His Arg Phe Ser Ser Gin Val Glu Glu Met Val 
400 405 410 



1299 



tag cag ttg gtt gta gaa gaa gga gaa atg aag gta tac aga aga gaa 1347 
Trp Gin Leu Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg Glu 
435 440 445 

gta aaa gaa aat ggg att gtt ctg gat cct tta aaa get acc cat gca 1395 
Val Glu Glu Asn Gly He Val Leu Asp Pro Leu Lys Ala Thr His Ala 
450 4S5 460 

gtt aaa ggc gtc aca gga cat gaa gtc tgc aat tat ttc tgg aac gtt 1443 
Val Lvs Gly Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val 
465 470 475 



gac gtt cgc aat gac tgg gaa aca act ata gaa aac ttt cat gtg gtg 
Asp Val Arg Asn Asp Trp Glu Thr Thr He Glu Asn Phe His Val Val 
4 80 4 85 4 90 

gaa aca tta get gat aat gca ate ate att tat caa aca cac aag agg 
Glu Thr Leu Ala Asp Asn Ala He He He Tyr Gin Thr His Lys Arg 
495 500 505 510 

gtg tgg c?t get tct cag cga gac gta tta tat ctt tct gtc att cga 
Val Trp Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Val He Arg 
515 520 525 

aag aca r:a gec ttg act gaa aat gac cct gaa act tgg ata gtt tgt 
Lys He Pro Ala Leu Thr Glu Asn Asp Pro Glu Thr Trp He Val Cys 
530 535 540 

aat ttt tct gtg gat cat gac agt get cct eta aac aac cga tgt gtc 
Asn Phe Ser Val Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys Val 
545 550 555 

cgc q:: aaa ata aat gtt get atg att tgt caa acc ttg gta age cca 
Arg Ala Lvs He Asn Val Ala Met He Cys Gin Thr Leu Val Ser Pro 
560 565 570 

cca gag gga aac cag gaa att age agg gac aac att eta tgc aag att 
Pro Glu Gly Asn Gin Glu He Ser Arg Asp Asn He Leu Cys Lys He 
575 580 585 590 

aca tat gta get aat gtg aac cct gga gga tgg gca cca gec tea gtg 
Thr Tvr Val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser Val 
595 600 605 

tta aag aca gtg gca aag cga gag tat cct aaa ttt eta aaa cgt ttt 
Leu Arg A^a Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg Phe 



1491 



1539 



1587 



1635 



1683 



1731 



1779 



1827 



1875 
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610 615 620 

act tct tac gtc caa gaa aaa act gca gga aag cct att ttg ttc 1920 
Thr Ser Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro lie Leu Phe 
625 630 635 

tagtattaac aggtactaga agatatgttt tatctttttt taactttatt tgactaatat 1980 

gactgtcaat actaaaattc agttgttgaa agtatttact atgttttttc cggaattc 2038 



<210> 24 

<211> 637 

<212> PRT 

<213> Artificial 



Sequence 



<220> 

<222> Description of Artificial Sequence: GPBP-D169A 
<400> 24 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met Ser Asp 
1 5 10 15 

Asr. Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu ASp Pro Glu Thr Glu 
20 25 30 

itrr Giy Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp Thr Asn 
35 40 45 

Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn Asn Ala 
SO 55 60 

Leu 5er Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys Arg Gly 
6S 70 75 80 

Ser He Cys Leu Ser Lys Ala Val lie Thr Pro His Asp Phe Asp Glu 
85 90 95 

Cys Arg Phe Asp lie Ser Val Asn Asp Ser Val Trp Tyr Leu Arg Ala 
100 105 110 

Gin Asp Pro Asp His Arg Gin Gin Trp lie Asp Ala lie Glu Gin His 
115 120 125 

Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg His Gly 
130 135 140 

Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala Thr Ser 
145 150 155 160 

Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu 
165 170 175 

Met Glu Thr Phe Arg Ala lie Leu Cys Arg Gin Val Asp Thr Leu Gin 
180 185 190 

Lys Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin 
195 200 205 

Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg 
210 215 220 
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Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu 
225 230 235 240 

Phe Pro His Val Thr Pro Lys Gly lie Asn Gly lie Asp Phe Lys Gly 
245 250 255 

Glu Ala lie Thr Phe Lys Ala' Thr Thr Ala Gly He Leu Ala Thr Leu 
260 265 270 

Ser His Cys He Glu Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys 
275 230 285 

Arg Leu Asp Lys Glu Thr Glu Lys Lys Arg Arg Thr Glu Glu Ala Tyr 
290 295 300 

Lys Asn Ala Met Thr Glu Leu Lys Lys Lys Ser His Phe Gly Gly Pro 
305 310 315 320 

Asp Tyr Glu Glu Gly Pro Asn Ser Leu He Asn Glu Glu Glu Phe Phe 
325 330 335 

Asp Ala Val Glu Ala Ala 'Leu Asp Arg Gin Asp Lys He Glu Glu Gin 
340 345 350 

Ser Gin Ser Glu Lys Val Arg Leu His Trp Pro Thr Ser Leu Pro Ser 
355 360 365 

Gly Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val Gin Lys Pro 
370 375 380 

Tyr Ser Arg Ser Ser Ser Met Ser Ser He Asp Leu Val Ser Ala Ser 
365 390 395 400 

Asp Asp Val His Arg Phe Ser Ser Gin Val Glu Glu Met Val Gin Asn 
405 410 415 

His Met Thr Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp Gin 
420 425 430 

Leu Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg Glu Val Glu 
435 440 445 

Glu Asn Gly He Val Leu Asp Pro Leu Lys Ala Thr His Ala Val Lys 
450 455 460 

Gly Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val Asp Val 
465 470 475 480 

Arg Asn Asp Trp Glu Thr Thr He Glu Asn Phe His Val Val Glu Thr 
485 490 495 

Leu Ala Asp Asn Ala He He He Tyr Gin Thr His Lys Arg Val Trp 
500 505 510 

Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Val He Arg Lys He 
515 520 525 

Pro Ala Leu Thr Glu Asn Asp Pro Glu Thr Trp He Val Cys Asn Phe 
530 535 540 
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Ser Val Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys Val Arg Ala 
545 550 555 ~ 560 

Lys lie Asn Val Ala Met lie Cys Gin Thr Leu Val Ser Pro Pro Glu 
565 570 575 

Gly Asn Gin Glu lie Ser Arg Asp Asn lie Leu Cys Lys lie Thr Tyr 
580 * 585 590 

Val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser Val Leu Arg 
595 600 605 

Ala Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr Ser 
610 615 620 

Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro lie Leu Phe 
625 630 635 

<210> 25 

<211> 12482 

<212> DNA 

<213> Homo sapiens 

<400> 25 

tcgatcattt ccctcttcat attcagtgta tattgcacag atctctcaac aacacagcca 60 
ttaaatagat attctccaag tgacacttac atcacacatg tttgagttta cgttacttgc 120 
aaacacaggg aaagaaagat acatgggata aactggtgca tgagaaatga gaccttagca 180 
gttggctgaa ataaatgaga acaactgagg caaactaaag aggaagaagg gcaagtggca 240 
gctcaacagg agtaagatga tgagatgaag ggcagaatac cttcatggag aggaggcaaa 3 00 
gagatataca tgatatgttc ttaggaacat aactgaagca aacaatgata ttatttctaa 360 
ttacatataa acccgtgagt cagcctccca ggggcggcct gctaaggtag aatcattgga 420 
atgatttggc cagggtttgg ataggagaga attggcagca gcgttaagat tgacccacga 480 
caaaLaatgc tatgcaggta gcagggagtc tgactaggag caaaatcaac gaacttatcc 54 0 
cttgcctaac atagcatctg tggagtcaga aagaagaggt taaattggga tatctgaggc 600 
aagcaccagg atttgccatg tctgcggagt agtttcataa ttctaatggt tataagcact 660 
aaggcgttca ctaagcgaac gttggtagtt ccaggttata ttatccattc ttgagttaca 720 
aaatacactt taaaaccttc ccatcttaat attatatgtt tttttagtca cagagtgaaa 780 
aggtgagatt acattggcct acatccttgc cctctggaga tgccttttct tctgcgggga 840 
cacacagact tgtccaaaag gtaagctaat gtcagagctt actaaaagta caccttgtac 900 
tgttcttcat tgttggtgga aatatctttt atttgagacg gagtctcact ctgtcaccag 960 
agtggagtgc agtggcgcga tctcggctca ctacagtctc cacctcccgg gttcaagaga 102 0 
ttctcgtgcc tcagcctccc cggtagctgg gattacaggc atgtaccacc acacccagct 1080 
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aatttttgta tttttaatgg agacagtttc accatggcca ggatggtctt gatctcctga 114 0 
ccttgtgatc cacccacctc agcctcccag agtgctggga ttacaggcgt gagccaccat 1200 
gcccagccgg aaatatcttg tagtatataa gttttctccc cttttcatta atttaagtaa 1260 
tgagactgtt: tttggtttta tatattgtat tccatataca tcctccaaaa cagttagaaa 1320 
ttttgttctg aaaataaagt tctttcattt ttatttaagg ggaaagttgg gggtgggcaa 13 3 0 
ataaggagtg gctagtccaa aatagttaac cagaagtata tccagttata ctaaatctct 144 0 
crcttctttg gggttaaatg gtattacttt gtattattgg aagcactaca ttcttzttttg 1500 
gaatgatttt ggaacataat acataatagg tgcatgaagt cagcagttgc tgctgtgctt 1560 
gtttcatata gtgctttgtt ttctcttccc tttatcttgt gtttggaagt tggtactgaa 1620 
cgctccgctg tgcctttgtt ctgattactt ggttttttct ttgtctgtct ctggtagccc 1680 
ratagccgct cttcctccat gtcttccatt gatctagtca gtgcctctga tgatgttcac 1740 
agacrcagct cccaggtact gtatgaatgt atagagtgga cttgagtctt. tctgtgctat. 1800 
atttcagccc gctttcccag ttcctagaaa tcttttggtt aggccactga ttttagtttt 1860 
qaattttaaa tagtaacatt aagcattaaa aaggtcttcc ttgtctacta aatagttcct 1920 
ctqccaggrc tgcatgtgtc ctttactatt cacagcttgg aattttgtca tataggaggt 1980 
actccaqaaa gattttcaaa ctgaattgaa acaaatagaa gatactgggt tttgtatatc 2040 
atgtaatatc tgtttcttca gtcaggattt agcagttttg atggacgtgg tccatatgat 2100 
atgttatagc agaaaagcag atttttacaa gtctcacttt aaagcctaaa gtacccccaa 2160 
ttaatat:ca acaaggaaat cactttttaa taatatgttt catttccatt ataatactaa 2220 
acrc:attga gcagatrgtg ttttccttat gcaaattacc tttggatatt ataaatgaat 2280 
a:::ctc::c atatgctaaa tctatggaaa tttgttttaa tttttagcat tggtaagggn 2340 
ctaggaattc aagacaggaa gctggatgct tgcggtctct aaagtctgta ccctcaaaat 2400 
aaaaccagat taccattgga agaagttttt tttagtgtca gcgttagttc tttttttaat 2460 
:ctct:aacc ttcacatctt tgccattcaa ctttttatct ttctggtgat tgcattttat 2520 
tggacnagat tatattatgt taatcttata ttaaagacct gagcactctg gccagaatga 2580 
ctcagttcaa accctggtta ggtgtatgat cccagtaagt tttctaactt ttttgtgctt 2640 
catttttaLg atttagctag aacctgacac ataataagtg ctcaataaat gttaccttgt 2700 
actgctacta taacataatt tctttgagct aataaaagtt atctacatca ttattttttc 2760 
ctctgtgaga gtattgctat aaaagttttt aaaagtcata gtttaaagag atttctatta 2820 
ttcttatgtt tataaacaaa gtttacatta gtttttaacc tgcaatagag aagaatatta 2880 
agactttaat ctttctgact tgtacagcgt ttttctcctt gaatactctt aagaaaaaga 2940 
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tttagcaatt 


ctggatcaga 


aatcatccat 


aaccaaatat 


accacagtat 


attttacctt 


3000 


ttgctcgtcc 


atttatgcat 


ttttttttaa 


ttttacttat 


ttattttcga 


gacagggtct 


3060 


tgctctgttg 


cccaggctgg 


agtgcagtgg 


cacgatctgg 


gctcactgca 


acctccatct 


3120 


cccaggttca 


agcaattccc 


ctgcctcagc 


ctcccaagta 


gctgggatta 


caggcacgca 


3180 


ccactatgcc 


cagctaattt 


ttgtattctt 


agtaaagacg 


ggttttcacc 


atgttggcca 


3240 


ggctggtcta 


gcactcctga 


cctcgtgatc 


tgcccacctc 


ggcctcccaa 


agtgctggga 


3300 


ttacaggtgt 


gagccaccat 


gcccggccct 


gcgtatgttt 


ttaaaaagag 


actcatattc 


3360 


ataatgaatc 


tgtgacaaaa 


ctacataata 


ctgggagact 


ttggtttatt 


gtgctaagct 


3420 


ccacattgca 


ttaaaatcat 


atcacagact 


aatcaaaaat 


gcaggaatac 


ataggctata 


3480 


aatgaaagaa 


aatataatga 


cagcaaagaa 


agaatgtaag 


ccagtaataa 


agaatgccta 


3540 


agaattaggg 


gttcagaacc 


caaaccaggg 


ccctcactgt 


agtgctgtag 


aacagctgaa 


3600 


ttgcttt.taa 


gtccaggtaa 


ctatatcact 


gagaagcagg 


tgcctatatt 


tttacaaaat 


3660 


tttgctgaca 


gcttacttct 


tcgtaatatt 


aatacccttt 


tgtaaaactc 


atgtatgtaa 


3720 


cttgagagaa 


atcttgctgg 


atttttttct 


ctaatatacg 


gtgctcatga 


ttgatcagat 


3780 


cctgttttag 


cctttgatta 


tgcactgttt 


tatatgccag 


aagaggtaaa 


aatgaagaaa 


3840 


ataacattaa 


ggtcttcaag 


tatttgttgt 


ccttgctaaa 


gcattagttg 


tcattagcag 


3900 


acgtggactc 


tagcaattca 


ctgttgtaat 


taaattgtgt 


gccttatgtt 


cagcagtt.cc 


3960 


tttataatag 


atgactaatt 


cccaattgat 


aagatttttt 


gtttcagagg 


atgttacact 


4020 


gccttatcag 


ccattaccaa 


aggatctagc 


aagttgattc 


tgtatagtca 


cacttgagaa 


4080 


tatagcattg 


gatgtagatc 


tggagttaat 


attagttgag 


aaacatcgtg 


ttatctggaa 


4140 


aactcttcca 


gttcaacaca 


gtgtaaaatt 


atagtagtga 


ctatacagta 


gtgttacatt 


4200 


ttacagttct 


cacaccctat 


agagactttt 


gtattaacaa 


aataagaggc 


tcaaaggtta 


4260 


ttcattaaca 


ttagaaacac 


ttatgttata 


ttacattgca 


tcggtctttt 


ctgttttttg 


4320 


tttttttttt 


ttttttgaga 


cggagtttcg 


cttttgttgc 


ccaggctgga 


gtgcaatggt 


4380 


acgatcttgg 


ctcactgcac 


cctctgcccc 


ctggattcaa 


gcgattctct 


tgcctcagcc 


4440 


acctgagtag 


ctgggattac 


aggcacctgc 


caccacaccc 


agctaatttt 


ttttcatttt 


4500 


tagtagagat 


ggggtttcac 


catgttggcc 


aggctggtct 


cgaactcctg 


acctcaggtg 


4560 


atctgcccgc 


ctcggcctcc 


caaagtgctg 


ggattacagg 


catgagccgc 


cacacctggc 


4620 


ctacatcgtt 


ctcaatacac 


aaatatacat 


cagttactcc 


acagcgcttg 


atatgggagg 


4680 


taaccaaatt 


ctttgtttta 


taatatcttc 


ataattaatt 


aaaaaactaa 


gtcgacattt 


4740 



57 



WO 00/50607 PCT/IB00/00324 

ttaatcacct ttaataattt gccaaaatat tatataagca taatataatc aattcttact 4 800 
tactccaaca aattttaaaa gtccagatzac agataccata tctagtttct tgatcattta 4860 
tatcagctcc catacagaag ccttctaaat ctctggtaat ttcactttgc tgtttatata 4 920 
agtgttggct catgactacc ttgttcttct tgaaatgatg ttttatagcc ttgaattggc 4980 
tgaaataatc aagtgtacaa ttgagagatg ccctgaaaac agcttaaaat aaaatatgca 5040 
catctactag gaaattagta ccaacacatg aatctgtctg atgggcagat attaggaatg 5100 
aagtcactcc agatctgaga aattaaagtt gtaaaggact gcaagttctg tgtttttgtt 5160 
gttgttgttg ttgttgttgt tgtttgtttt ttcatttttg ttttttgggt ttttttgaga 5220 
cagagtctca ttctgtcacc ;aggctgtag tgcagtggca cgatctcaac tcactgcaac 52 80 
ctccgtctcc caggttcaag egattctcct gtctcagctg ggattacagg cacacgctat 5 340 
cacacccagc taatttttgt atttttagta gagacagggt ttcaccatgt tagccaggct 5400 
ggtctcgaac tcctgacctc aagtgatctg cccgtctcgg cctcccaaag tgctgggatt 5460 
acaggcctga gacaccatgc ccagcatttt tttttttttt tttttttttt gtaaagagac 5520 
aaggtttcac ttgtccaggc caagtgcagt ggcatgatca tagctctgta acctgacctc 5580 
tgacctccga cttcctggac acaagtgatc ctcctgtctc tcagcctccc aagtagctgg 5640 
gactacaggc attccaccac acccaactaa ttgtttttat tttttgtaga gacagggcct 5700 
tgctatgttg cccaggctgg caagttcttg aaataatggc tgtggccaca aactagaaaa 5760 
taattttcag gtgtacagag aatagaaaga atttagattc ataaattgat cattttgttc 5820 
acagttattt gcataacaca gttcacattt aaaggtgtca ccttagaaat caaaggggaa 5880 
gaacatcatc ctctattgaa aaagaaagaa atcaaaggat gtacagtgaa tttgcagctt 5 94 0 
aatctatggg gagcatcatt gcaaaaaacg gttctgtgtg aggctctttc ccaccctttg 6000 
tccacaggag cacattattg ttgtagtaat tatttcaccc ctctcccttt ttcagtgtac 6060 
aagtgataca tgctaatttt aacagaactt gaaagtagaa taaaattaaa ataatagttt 612 0 
actaatattc catctatctt ctctcatata tatgagataa atattaaggt gtatgtactt 6180 
atccatatgt gcctgatttt tiaaaatcct tgtatatgca tctttgcacc cttatctaat 6240 
tatttcctta gaatatattc ctagaagcat aattgtggga acaaaggcca tgaacatttt 6300 
caagtgttta ttttattatt ttattttatt tttattaatt ttgatacagg grtttgcttt 6360 
gttccccaga ctggagtgca gtggtgagat caccactcac tgcaccttga cctcctggac 6420 
tcaagcgatc cacctgcctc agtctcctca gtagcggggg ctaaggacta caggcacatg 6480 
ccatcatgcc cagctaattt ttttatttgt agcagagacg aggtctcact gtgttgccca 6540 
ggctgctatt ttatttattt tttaagagat agggtctcat tctgtcttcc aggctagaat 6600 
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gcagtggcac aatcatagct cactgcaacc 
tgggactaca ggcatgggcc accactctca 
gtagatatgg gggtctcact gtgttgccta 
tcttcccacc tcagcctccc aaagtgctag 
taaaaatttt tactgccaaa ctcttcatta 
cagttttcta ttgatatagt agcactgaat 
gtaggctaaa agtgctatgt tcttagccat 
gttattaata attattctat ctaacaagcc 
catagttgta tttccttttt tgaggtggag 
cggcgcgatc tcggctcact gcaccctccg 
agactcctga gtatctggga ctacaggcat 
ttttagtaga gaggggagtt caccgtgtta 
ggnccgcgtg ccccagcctc ccaaaatgct 
ccaacattcc ttttacatgc ataaaagaga 
tctttttttt tttttttttt tttttttttt 
tcttgctatg ttgccgagac ttaacctcaa 
gctgggacta caggcatgaa ccaccatgcc 
gcaattttct tttcaaattt ctgggtaata 
agaaacggca tcacactgag tgattcaaat 
tgaacgtccc ctcttatctc aaatgtactt 
ttgaagatgc tttgaatgtc acctaatcat 
catgatttct gaaattacac ttagcctaat 
aatgtttttg actgagtctt tttcattcat 
ttcctagaat gtcaaatttt gagcctaata 
tttgtttttg tttttttttt aacgaaactt 
tttttttttt tttttttttt tgagacggag 
tggcgtgatc ttggctcact gccacctccg 
agcctcctga gtagctggga ctacaggcac 
tttttagtag agacggggtt ttaccatgtt 
cgatctgccc gcctcagcct cccaaagtgc 



tcaagcgatc tttgcctcag cctgagtagc 6660 
gctaattttt ttttcaattt tttatttttt 6720 
ggctggtctt gaacccctag cctaaagtga 6780 
gattacaggc cacaggcctc agccaagttt 6840 
gaaaagttga accagcctac attcccaggc 6900 
attataattc agttaacttt tgtcaatacg 6960 
ctctcttttg ggttaacagt gcactatttt 7020 
ccctctatgg ttttgtggct ttgtagtaag 7080 
tcttgctatg ttgcccaggc tggagtgcag 7140 
cctcccgggt tcaagtgatt ctcctgcctc 7200 
gcaccaccac gcccagctaa ttttttatat 7260 
gccgggatgg tctctatctc ttgacctcgt 7320 
gtgattacag gcatgagcca ccctgcctgg 7380 
tctgagctgt ttttgagccc ttctagactt 7440 
tttttttttt ttttttttaa gtagatgagg 7500 
actcctaggc ccaagcaatc ctcccaagct 7560 
caacttagac ttttattgta ctatcaaaag 7620 
gtgttagaaa aatcctactt ggtaacatcc 7680 
gtgagatgga agaaaaggtt agaattggag 774 0 
tacctccatt ttgtttcata gtttattagt 7800 
tttcaactct aggtccagaa aaatcaaggg 7860 
taaaacttag aaacactgtt caccttcttc 7920 
aagtgacagg aggtgttact ataacattat 7980 
gcatggtaaa tttggctata tttgttgttt 8040 
agtatttcct tgtttcccac ttcttttttt 8100 
tctctctctg tcatccaggc tggagtgcaa 8160 
cctcgcaggt tcacgctatt ctcctttcac 8220 
ccaccaccac gcccggccaa tttttttgta 8230 
aggcaggatg gtctcgaact cctgaccttg 8340 
tgggattaca ggcgtgagcc accgcacctg 8400 
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gcctcccact tctttttaat atgtcgtgtc ataactgaac agtaaagtga gcagattatc 8460 
aggttaaatc tgaagtgtca gtctggtcac cagtgcccaa gttactgccc ctatggtaat 8520 
attggttact ttgtattttc ctacagcaaa cataaaattt gttatagtga gatttttacc 8580 
tgtatacccc tcttaacttt aatgtjtatta cctcaaggaa gatattatca tgaargaaga 8640 
ttccatgatg aaagttttgc agagtttatt gcagtaattt agcacttcat tagaatcttt 8700 
agttttttag gagcacagta ctgaatgttt gtttctttgt tggacctttt gaaaaccggt 8760 
tttccattga tgcagtgtag ctgttacagg aatatcattt ttaaaacgtt tttatacagc 8820 
atggctgaaa attgaacctg ggcctccctc gtggcctacc attgaaggaa cagcattttt 8 880 
tgcctatcta gaaagacaat gttaaatgtg ctatctatat atru.:taac ttgtgctacc 8940 
tactacgcgt ttatatttgt ggaatctgtt ttcttttgga caaaaccaca aatcaaaaac 9000 
acctcatttc ttaggcattt gaaatcccta attcagaata atctcccaaa cagaaacaca 9060 
actacctgca ttctttttga caaaagagct aagtagcatt agaaaattat tttaaaccca 9120 
attctgtttt ttaacagaat aaaattcttc tgttcttcac attcttcttt cataggtaac 9180 
ctattgaaag tagggtttat ttgggggaag catttctttc tgtctcttat ctcataataa 9240 
atacaggtgt gcttaactac tagtttccta cctcaaagat atactcaaat ctaaagatgt 9300 
ttaagatttt gggatctgaa gagtaaacat ttctcctaat cacaatgtga cagagacaaa 9360 
tgaatcaagc caatgctact tttatttatg catactaact ggaacttttc tttttggaaa 9420 
tcagatacat tttgtatgta ttagtaattt ggaatcctgc attggttatc ctcgccctcc 94 8 0 
caaagcagat tctgaaatca taaaggtgca caggttctcc atgcaacacc aaaagttata 9540 
ttttccaagg ctttgtaaaa ttgtagaatg tcctgttaaa tttctgtcaa atcagtaact 9600 
cacactgttt tgagaattat gaataaagga ataaaatatt gttagtgttt atttagtaca 9660 
aaagtagatt atagaatctc agcatttttg tcaaaaaatt tctttttgat gattgacaga 9720 
tcaggagaca cttaaggcca tacctgcttt cagtaatcaa aaatgcattt aagatccaga 97 80 
aacttgaggt agcagaacat cactatcaca tataacatat cctttggtat agaaaattat 9840 
attcccagag tgagtttctt ttttaaaacc attaatgagg ccaaggtggg aagatcactt 9900 
gggaccagga gttcaagacc aagcctgggc cagatggcga gaccctgtct ctacaaaaaa 9960 
ttaactggat gtggtggtgc actcctgtag tcccacctac tcagaggctg aggcaggagg 10020 
atcccttgag cccaggaaat tgtagtggca gtgagctatg atcatactac tgtactgcag 10080 
tctgggccac gaagtgagac cgtgtctctt aaaaaaaaaa aaatgttagg catggtggca 10140 
caggcatata gttttagcta cttaggaggc tgaggcagga ggatcacttg agcccagaag 10200 
ttcaagatta cagtgagtta tgattgtgcc gctgcactcc aacctgggtg acaaaataac 10260 
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cctgtctctg gcgggtaggg gggaagttga ttatttactt tgaaatatgt tcaaaactga 10320 
ttcctgttct atattcctaa tgaacagaat agactttata taaaacaaat agttaaactt 10380 
aaggataaaa ttttaatgga agtataatac atatatcttc cagctcttct gtcttctaat 10440 
gtatttatta cagaaaatga aattactttg tttccgcaan ctttgtatca cttcagttct 10500 
ccaataaatc tgagaattct ggtagtgtga aatattcagc tttctttgct tatttacata 10560 
aaatgtataa ggacaatttg tgataattaa gagttacatt taaatatcag gaaaaagtta 1062 0 
taaatttaaa ttaaaaaatt ttaaaaggaa attattagaa attttaaaag aacgaactaa 10680 
aaggtgatta tatgtaaatg cttgcatata tgaatattag cattgtcccc aaaataattt 10740 
agaacaaaga aattggaatc aaataaataa aggcttgatc atttttaaat tggcttatat 10800 
tccatgataa aagagaggtt tatcagtggc ataagaaagg tttttcacct tttttgtatt 10860 
gaaatctttg acatatacat atatatcttt gctcatcntt gtgtatcttt gctcgtatga 10920 
gagcaaagat ataggcaaag atatgctctc tctctctatg cctttgttca taccaagacc 10980 
ttcctgatat ctccacataa tcttaaatat aggaacatta gactggatga tctctgtgcc 11040 
ccctttatct ctactcttcc attattttat actttaacac atcatctctg ttttatgata 11100 
taagaatgga atatttcttt tttcctgaaa atgcttatitt tggtcacttg atacacatta 11160 
ggccaatacg tgttacttga gtgacccatc ttccttcttt tcatttctgt ctcctgtcat 11220 
taacctggat atctggaatg tggactaaac tcttcaaaca ctatgtaaaa cctactaacc 11280 
tttgtgcatt tggttgctca gctactaaga gcaccatttc tgaactgaag ttaaccgaag 11340 
accattctgt tttagagatt atgacatacc ttttggattc tcatgccttt ttcctccctt 11400 
ctcaaggttg aagagatggt gcagaaccac atgacttact cattacagga tgtaggcgga 11460 
gatgccaatt ggcagttggt tgtagaagaa ggagaaatga aggtaattcc ccctgaaatg 11520 
ttatagattg ccaaaggcgt ctctgtttca gtcatattat cattactatt gatatgaata 11580 
aggatagcac tttcaactta cctttaaaac aaattattac atgrgatcaa agcagtacca 11640 
tatattgagc aataaaatgt ctttttgctt ttctggcttt gcctttacta aaggttctta 11700 
tgattacaat ataaatatat gactaaacct ttctgttttg actaggccat gaagaaaata 11760 
aaatttagag aattagatat gaccaggtca caattagctg atggtcctgt atttggatat 11820 
ttccttttgt tttgtttttt taacatactg aatgttgtgc ctagatgaca ctttgtttct 11880 
ctcccttttt ggtccatacc ctccttcttt tcccttctct tactgcacct ttaattgata 11940 
tttggacatt ggtcagttaa ccctggttac atccctaaac acatggacag aaaataagag 12000 
cagggactga gagatacaga gatggattga aaagcaaaag caacattgaa ttttggattt 12060 



61 



WO 00/50607 PCT/IB00/00324 

tctcattcct aaggaactat gctaaataaa gatacaaaga taataagaca ctctccaagc 12120 
taaagcttca gttaaggaaa aagaatattg acatttaaaa gatactattg gccaggcaca 12180 
gtggctatgc ctgtaatccc agcactttta ggaggacatg gcaggcggat tacttgagct 12240' 
caggagttca agtcaaacct gggcaacacg gtgaaacccc gtctctacca aaaatacaaa 12300 
aattagctgg gtgcagtacc acacacttgt agtcccagct acccaggagg ctgggcaaaa 12360 
gattccttga gccagggagg tcaaggctgc aatgagccgc gtttgtgcca ctgcactcta 12420 
gcctgggtca caaagtgaga ccctgtgtga gatatatata tatatatata tatatatata 12480 
ta 12482 



<210> 26 
<211> 21 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPpepl 
<400> 26 

Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr Thr Arg 
15 10 15 

Gly Phe Val Phe Thr 
20 



<210> 27 
<211> 21 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPpeplAla9 
<400> 27 

Lys Gly Lys Arg Gly Asp Ala Gly Ser Pro Ala Thr Trp Thr Thr Arg 
1 5 10 15 

Gly Phe Val Phe Thr 
20 



<210> 28 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP-54m 



<400> 28 

tcgaattcac catggcccca ctagccgact acaaggacga cgatgacaag 



50 
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<210> 29 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP-55c 
<400> 29 

ccgagcccga cgagttccag ctctgattat ccgacatctt gtcatcgtcg 50 



<210> 30 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-HNC-B-N- 14m 
<400> 30 

cgggatccgc tagctaagcc aggcaaggat gg 32 



<210> 31 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-HNC-B -N- 1 6c 
<400> 31 

cgggatccat gcataaatag cagttctgct gt 32 



<210> 32 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FLAG peptide 
<400> 32 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 



<210> 33 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hypothetical 
peptide 

<400> 33 

Pro Arg Ser Ala Arg Cys Gin Ala Arg Arg Arg Arg Gly Gly Arg Thr 
1 5 10 15 
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Ser Ser 



34 
DNA 

Artificial Sequence 
O20> 

<223> Description of Artificial Sequence: ON~GPBP-llm 
<400> 34 

gcgggactca gcggccggat tttct 25 



<210> 
<211> 
^212 > 
<23,3> 



<210> 35 
c211> 18 
<212> DNA 

<213> Artificial Sequence 
<22 0 > 

<223 > Description of Artificial Sequence: ON-GPBP-15m 
<400> 35 

acagctggca gaagagac 18 



<210> 36 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP-20C 
<400> 3€ 

catgggtagc ttttaaag 18 



<210> 37 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP - 22m 
<400> 3" 

tagaagaaca gtcacagagt gaaaagg 



<210> 38 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

< 2 2 3 Description of Artificial Sequence: ON-GPBP- 53c 
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<400> 38 

gaattcgaac aaaataggct ttc 



<210> 39 

<211> 17 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: ON-GPBP-56m 



<400> 39 

ccctatagtc gctcttc 



<210> 40 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
v22C> 

-223 > Description of Artificial Sequence: ON-GPBP-57c 

4CG> 4 0 
ctgggagctg aatctgt 



-213 > Artificial Sequence 

-223* Description of Artificial Sequence: ON-GPBP-62c 
-400. 4; 

gtggttctgc accatctctt caac 



-2 10 > 42 
•■ — «. » ■» * 
v212- DNA 

-213- Artificial Sequence 
<22Q> 

<223> Description of Artificial Sequence: ON-GPBP-26 
<400> 42 

cacatagatt tgtccaaaag gttgaagaga tggtgcagaa c 



<210> 43 
<211> 19 
<212> PRT 

<213> Artificial Sequence 



<220> 

. <223> Description of Artificial Sequence: GPIII derived 
peptide 
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<400> 43 

Gin Arg Ala His Gly Gin Asp Leu Asp Ala Leu Phe Val Lys Val Leu 
15 10 15 

Arg Ser Pro 



<210> 44 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPIII-IV-V 
derived peptide 

<400> 44 

Gin Arg Ala His Gly Gin Asp Leu Glu Ser Leu Phe His Gin 
1 ~ 5 10 



<210> 45 
<211> 685 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPDV 

<220> 

<221> CDS 

<222> (1) . . (633) 

<400> 4 "3 

ggt tt~ aaa gga aaa cgt gga gac agt gga tea cct gca acc tgg aca 4 8 

Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 

1 5 10 15 



acg aga ggc ttt gtc ttc acc cga cac agt caa acc aca gca att cct 
Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala He Pro 
20 25 30 



96 



tea tgt cca gag ggg aca gtg cca etc tac agt ggg ttt tct ttt ctt 144 
Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
35 40 45 



ttt gta caa gga aat caa cga gec cac gga caa gac ctt gga act ctt 
Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Gly Thr Leu 
50 55 60 

ggc age tgc ctg cag cga ttt acc aca atg cca ttc tta ttc tgc aat 
Gly Ser Cys Leu Gin Arg Phe Thr Thr Met Pro Phe Leu Phe Cys Asn 
65 70 75 80 

gtc aat gat gta tgt aat ttt gca tct cga aat gat tat tea tac tgg 
Val Asn Asp Val Cys Asn Phe Ala Ser Arg Asn Asp Tyr Ser Tyr Trp 
85 90 95 

ctg tea aca cca get ctg atg cca atg aac atg get ccc att act ggc 
Leu Ser Thr Pro Ala Leu Met Pro Met Asn Met Ala Pro He Thr Gly 



192 



240 



288 



336 
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100 105 110 

aga gcc ctt gag cct tat ata age aga tgc act gtt tgt gaa ggt cct 3 84 

Arg Ala Leu Glu Pro Tyr lie Ser Arg Cys Thr Val Cys Glu Gly Pro 
115 120 125 

gcg ate gcc ata gcc gtt cac age caa acc act gac att cct cca tgt 432 

Ala lie Ala lie Ala val His'Ser Gin Thr Thr Asp lie Pro Pro Cys 

130 135 140 

cct cac ggc tgg att tct etc tgg aaa gga ttt tea ttc ate atg aaa 480 

Pro His Gly Trp lie Ser Leu Trp Lys Gly Phe Ser Phe lie Met Lys 
145 150 155 160 

gcc tat tec ate aac tgt gaa age tgg gga att aga aaa aat aat aag 528 

Ala Tyr Ser lie Asn Cys Glu Ser Trp Gly lie Arg Lys Asn Asn Lys 
165 170 175 

teg ctg tea ggt gtg cat gaa gaa aag aca ctg aag eta aaa aag aca 576 

Ser Leu Ser Gly Val His Glu Glu Lys Thr Leu Lys Leu Lys Lys Thr 
180 185 190 

gca gaa ctg eta ttt ttc ate eta aag aac aaa gta atg aca gaa cat 624 

Ala Glu Leu Leu Phe Phe lie Leu Lys Asn Lys Val Met Thr Glu His 
195 200 205 

get gtt att taggtatttt tctttaacca aacaatattg ctccatgatg 673 
Ala Val He 
210 



acttagtaca aa 



685 



<210> 46 
<211> 211 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPDV 
<400> 46 

Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
1**5 10 15 

Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala He Pro 
2 0 25 3 0 

Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
35 40 45 

Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Gly Thr Leu 
50 55 60 

Gly Ser Cys Leu Gin Arg Phe Thr Thr Met Pro Phe Leu Phe Cys Asn 
65 70 75 80 

Val Asn Asp Val Cys Asn Phe Ala Ser Arg Asn Asp Tyr Ser Tyr Trp 
85 90 95 

Leu Ser Thr Pro Ala Leu Met Pro Met Asn Met Ala Pro He Thr Gly 
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100 105 110 

Arg Ala Leu Glu Pro Tyr lie Ser Arg Cys Thr Val Cys Glu Gly Pro 
115 120 125 

Ala lie Ala He Ala Val His Ser Gin Thr Thr Asp He Pro Pro Cys 
130 135 , 140 

Pro His Gly Trp He Ser Leu Trp Lys Gly Phe Ser Phe He Met Lys 
145 150 155 160 

Ala Tyr Ser He Asn Cys Glu Ser Trp Gly He Arg Lys Asn Asn Lys 
165 170 175 

Ser Leu Ser Gly Val His Glu Glu Lys Thr Leu Lys Leu Lys Lys Thr 
180 185 190 

Ala Glu Leu Leu Phe Phe He Leu Lys Asn Lys Val Met Thr Glu His 
195 200 205 

Ala Val He 
210 



. 2 11 > 680 
<2 12> DNA 

« 2 1 3 > Artificial Sequence 
<220> 

v223^ Description of Artificial Sequence: GPDIII 

*.220> 

O 2 1 > CDS 

v222- ll) . . (216) 

<400> 47 

ggt ttg aaa gga aaa cgt gga gac agt gga tea cct gca acc tgg aca 4 8 

31v Leu Lvs Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
; 5 10 15 



acg aga ggc ttt gtc ttc acc cga cac agt caa acc aca gca att cct 
Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala He Pro 
20 25 30 
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tea tgt cca gag ggg aca gtg cca etc tac agt ggg ttt tct ttt ctt 144 
Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
35 40 45 

ttt gta caa gga aat caa cga gee cac gga caa gac ctt gat gca ctg 192 
Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Asp Ala Leu 
50 55 60 

ttt gtg aag gtc ctg cga teg cca tagcegttea cagccaaacc actgacattc 246 
Phe Val Lys Val Leu Arg Ser Pro 
65 70 

ctccatgtcc teaeggctgg atttctctct ggaaaggatt ttcattcatc atgttcacaa 306 

gtgcaggttc tgagggcacc gggcaagcac tggcctcccc tggctcctgc ctggaagaat 366 
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tccgagccag cccatttcta gaatgtcatg gaagaggaac gtgcaactac tattcaaatt 426 

cctacagttt ctggctggct ccattaaacc cagaaagaat gttcagaaag cctattccat 486 

caactgtgaa agctggggaa ttagaaaaaa taataagtcg ctgtcaggtg tgcatgaaga 54 6 

aaagacactg aagctaaaaa agaca^caga actgctattt ttcatcctaa agaacaaagt 606 

aatgacagaa catgctgtta tttaggtatt tttctttaac caaacaatat tgctccatga 666 
tgacttagta caaa 680 

<210> 48 
<211> 72 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPDIII 
<400> 48 

Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
1 5 10 15 

Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala lie Pro 
20 25 30 

Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
35 40 45 

Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Asp Ala Leu 
50 55 60 

Phe Val Lys Val Leu Arg Ser Pro 
65 70 



<210> 49 
<211> 392 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : GPDIII-IV-V 

<220> 

<221> CDS 

<222> (1) . . (207) 

<400> 49 

ggt ttg aaa gga aaa cgt gga gac agt gga tea cct gca acc tgg aca 48 

Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
15 10 15 

acg aga ggc ttt gtc ttc acc cga cac agt caa acc aca gca att cct 96 
Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala lie Pro 
20 25 30 

tea tgt cca gag ggg aca gtg cca etc tac agt ggg ttt tct ttt ctt 144 
Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
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35 40 45 

ttt gtacaa gga aat caa cga gcc cac gga caa gac ctt gaa age eta 192 
Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Glu Ser Leu 
50 55 60 

ttc cat caa ctg tga aagctgggga attagaaaaa ataataagtc gctgtcaggt 247 
Phe His Gin Leu 

gtgcatgaag aaaagacact gaagctaaaa aagacagcag aactgetatt tttcatccta 3 07 
aagaacaaag taatgacaga acatgctgtt atttaggtat ttttctttaa ccaaacaata 367 
ttgctccatg atgacttagt acaaa 3 92 



<210> 50 
<211> 68 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPDIII-IV-V 
<400> 50 

Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
15 10 15 

Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala lie Pro 
2 0 25 30 

Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
35 40 45 

Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Glu Ser Leu 
50 55 60 

Phe His Gin Leu 
65 



<210> 51 
<211> 507 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPDIII-V 

<220> 

<221> CDS 

<222> (1) . . (216) 

<400> 51 

ggt ttg aaa gga aaa cgt gga gac agt gga tea cct gca ace tgg aca 48 

Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 

1 5 1*0 15 

acg aga ggc ttt gtc ttc ace cga cac agt caa ace aca gca att cct 96 
Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala lie Pro 
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20 25 30 

tea tgt cca gag ggg aca gtg cca etc tac agt ggg ttt tct ttt ctt 144 
Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
35 40 45 

ttt gta caa gga aat caa cga gec cac gga caa gac ctt gat gca ctg 192 
Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Asp Ala Leu 
50 55 60 

ttt gtg aag gtc ctg cga teg cca tagcegttea cagccaaacc actgacattc 246 
Phe Val Lys Val Leu Arg Ser Pro 
65 ~ 70 

ctccatgtcc teaeggctgg atttctctct ggaaaggatt ttcattcatc atgaaagect 306 

attccatcaa ctgtgaaagc tggggaatta gaaaaaataa taagtcgctg tcaggtgtgc 3 66 

atgaagaaaa gacactgaag ctaaaaaaga cagcagaact gctatttttc atcctaaaga 426 

acaaagtaat gacagaacat gctgttattt aggtattttt ctttaaccaa acaatattgc 486 

tccatgatga cttagtacaa a 507 



<210> 52 
<211> 72 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPDIII-V 
<400> 52 

Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
15 10 15 

Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala lie Pro 
20 25 30 

Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
35 40 45 

Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Asp Ala Leu 
50 55 60 

Phe Val Lys Val Leu Arg Ser Pro 
65 70 



<210> 53 
<211> 659 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: HMBP-21 

<220> 

<221> CDS 

<222> (37) . . (627) 
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<400> 53 

gaaaacagtg cagccacctc cgagagcctg gatgtg atg gcg tea cag aag aga 54 

Met Ala Ser Gin Lys Arg 
1 5 

ccc tec cag agg cac gga tec ^ aag tac ctg gec aca gca agt acc atg 102 
Pro Ser Gin Arg His Gly Ser Lys Tyr Leu Ala Thr Ala Ser Thr Met 
10 15 20 

gac cat gec agg cat ggc ttc etc cca agg cac aga gac acg ggc ate 150 
Asp His Ala Arg His Gly Phe Leu Pro Arg His Arg Asp Thr Gly lie 
25 30 35 

ctt gac tec ate ggg cgc ttc ttt ggc ggt gac agg ggt gcg cca aag 198 
Leu Asp Ser lie Gly Arg Phe Phe Gly Gly Asp Arg Gly Ala Pro Lys 
40 45 50 

egg ggc tct ggc aag gta ccc tgg eta aag ccg ggc egg age cct ctg 24 6 
Arg Gly Ser Gly Lys Val Pro Trp Leu Lys Pro Gly Arg Ser Pro Leu 
55 60 65 70 

ccc tct cat gee cgc age cag cct ggg ctg tgc aac atg tac aag gac 294 
Pro Ser His Ala Arg Ser Gin Pro Gly Leu Cys Asn Met Tyr Lys Asp 
75 80 85 

tea cac cac ccg gca aga act get cac tat ggc tec ctg ccc cag aag 342 
Ser His His Pro Ala Arg Thr Ala His Tyr Gly Ser Leu Pro Gin Lys 
90 95 100 

tea cac ggc egg acc caa gat gaa aac ccc gta gtc cac ttc ttc aag 390 
Ser His Gly Arg Thr Gin Asp Glu Asn Pro Val Val His Phe Phe Lys 
105 110 115 

aac att gtg acg cct cgc aca cca ccc ccg teg cag gga aag ggg aga 43 8 
Asn lie Val Thr Pro Arg Thr Pro Pro Pro Ser Gin Gly Lys Gly Arg 
120 125 130 

gga ctg tec ctg age aga ttt age tgg ggg gee gaa ggc cag aga cca 486 
Gly Leu Ser Leu Ser Arg Phe Ser Trp Gly Ala Glu Gly Gin Arg Pro 
135 140 145 150 

gga ttt ggc tac gga ggc aga gcg tec gac tat aaa teg get cac aag 534 
Gly Phe Gly Tyr Gly Gly Arg Ala Ser Asp Tyr Lys Ser Ala His Lys 
155 160 165 

gga ttc aag gga gtc gat gee cag ggc acg ctt tec aaa att ttt aag 5 82 
Gly Phe Lys Gly Val Asp Ala Gin Gly Thr Leu Ser Lys lie Phe Lys 
170 175 180 



ctg gga gga aga gat agt cgc tct gga tea ccc atg get aga cgc 
Leu Gly Gly Arg Asp Ser Arg Ser Gly Ser Pro Met Ala Arg Arg 
185 190 195 



627 



tgaaaaccca cctggttccg gaatcctgtc ct €59 



<210> 54 
<211> 197 
<212> PRT 
<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: HMBP-21 
<400> 54 

Met Ala Ser Gin Lys Arg Pro Ser Gin Arg His Gly Ser Lys Tyr Leu 
1 5 . 10 15 

Ala Thr Ala Ser Thr Met Asp His Ala Arg His Gly Phe Leu Pro Arg 
20 25 30 

His Arg Asp Thr Gly lie Leu Asp Ser lie Gly Arg Phe Phe Gly Gly 
35 40 45 

Asp Arg Gly Ala Pro Lys Arg Gly Ser Gly Lys Val Pro Trp Leu Lys 
50 55 60 

Pro Gly Arg Ser Pro Leu Pro Ser His Ala Arg Ser Gin Pro Gly Leu 
65 70 75 80 

Cys Asn Met Tyr Lys Asp Ser His His Pro Ala Arg Thr Ala His Tyr 
85 90 95 

Gly Ser Leu Pro Gin Lys Ser His Gly Arg Thr Gin Asp Glu Asn Pro 
100 105 ' 110 

Val Val His Phe Phe Lys Asn lie Val Thr Pro Arg Thr Pro Pro Pro 
115 120 125 

Ser Gin Gly Lys Gly Arg Gly Leu Ser Leu Ser Arg Phe Ser Trp Gly 
130 135 140 

Ala Glu Gly Gin Arg Pro Gly Phe Gly Tyr Gly Gly Arg Ala Ser Asp 
145 150 155 160 

Tyr Lys Ser Ala His Lys Gly Phe Lys Gly Val Asp Ala Gin Gly Thr 
165 170 175 

Leu Ser Lys lie Phe Lys Leu Gly Gly Arg Asp Ser Arg Ser Gly Ser 
180 185 190 

Pro Met Ala Arg Arg 
195 
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Glu Leu Gin Arg 
195 

Thr Thr Arg Ser 
210 

Glu Lys Leu Phe 
225 

Phe Lys Gly Glu 



Ala Thr Leu Ser 
260 

Trp Gin Lys Arg 
275 

Glu Ala Tyr Lys 
290 

Gly Gly Pro Asp 
305 

Glu Phe Phe Asp 



Glu Glu Gin Ser 
340 

Leu Pro Ser Gly 
355 

Gin Lys Pro Tyr 
370 

Ser Ala Ser Asp 
385 

Val Gin Asn His 



Asn Trp Gin Leu 
420 

Glu Val Glu Glu 
435 

Ala Val Lys Gly 
450 

Val Asp Val Arg 
465 

Val Glu Thr Leu 



Arg Val Trp Pro 
500 

Arg Lys lie Pro 



Asp Lys Val Val 
200 

Asp Gly Asp Phe 
215 

Pro His Val Thr 
230 

Ala lie Thr Phe 
245 

His Cys He Glu 



Leu Asp Lys Glu 
280 

Asn Ala Met Thr 
295 

Tyr Glu Glu Gly 
310 

Ala Val Glu Ala 
325 

Gin Ser Glu Lys 



Asp Ala Phe Ser 
360 

Ser Arg Ser Ser 
375 

Asp Val His Arg 
390 

Met Thr Tyr Ser 
405 

Val Val Glu Glu 



Asn Gly He Val 
440 

Val Thr Gly His 
455 

Asn Asp Trp Glu 
470 

Ala Asp Asn Ala 
485 

Ala Ser Gin Arg 



Ala Leu Thr Glu 



Glu Asp Asp Glu 



Leu His Ser Thr 
220 

Pro Lys Gly He 
235 

Lys Ala Thr Thr 
250 

Leu Met Val Lys 
265 

Thr Glu Lys Lys 



Glu Leu Lys Lys 
300 

Pro Asn Ser Leu 
315 

Ala Leu Asp Arg 
330 

Val Arg Leu His 
345 

Ser Val Gly Thr 



Ser Met Ser Ser 
380 

Phe Ser Ser Gin 
395 

Leu Gin Asp Val 
410 

Gly Glu Met Lys 
425 

Leu Asp Pro Leu 



Glu Val Cys Asn 
460 

Thr Thr He Glu 
475 

He He He Tyr 
490 

Asp Val Leu Tyr 
505 

Asn Asp Pro Glu 



Asp Asp Phe Pro 
205 

Asn Gly Asn Lys 



Asn Gly He Asp 
240 

Ala Gly He Leu 
255 

Arg Glu Asp Ser 
270 

Arg Arg Thr Glu- 
285 

Lys Ser His Phe 



He Asn Glu Glu 
320 

Gin Asp Lys He 
335 

Trp Pro Thr Ser 
350 

His Arg Phe Val 
365 

He Asp Leu Val 



Val Glu Glu Met 
400 

Gly Gly Asp Ala 
415 

Val Tyr Arg Arg 
430 

Lys Ala Thr His 
445 

Tyr Phe Trp Asn 



Asn Phe His Val 
480 

Gin Thr His Lys 
495 

Leu Ser Val He 
510 

Thr Trp He Val 
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515 520 525 

Cys Asn Phe Ser Val Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys 
530 535 540 

Val Arg Ala Lys lie Asn Val Ala Met He Cys Gin Thr Leu Val Ser 
545 550 555 560 

Pro Pro Glu Gly Asn Gin Glu He Ser Arg Asp Asn He Leu Cys Lys 
565 570 575 

He Thr Tyr Val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser 
580 585 590 

Val Leu Arg Ala Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg 
595 600 605 

Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro He Leu Phe 
610 615 620 



<210> 3 

v 2 1 1 > 2762 

v:::> dna 

«,213> Mus musculus 

O 2 0 > 

• 221 > CDS 

■ 222 > < 444 ) . . (2315) 

* 4 C 0 > 3 

cgggccacca cgtgtaaata gtatcggacc cggcaggaag atggcggctg tagcggaggt 60 

gtgagtgagt ggatctgggt ctctgccgtt ggcttggctc ttcccgtctt cctcccctcc 120 

tcccrccctg actgaggttg gcatctaggg ggccgagttc aggtggcggc gccgggcgca 180 

acgcaggggt cacggccacg gcggctgacg gctggaaggg caggctttct tcgccgctcg 240 

::c:cc:tc: ccggtccgct cggtgtcagg cgcggcggcg gcggcgcggc gggcgcgctt 300 

cgtccctctt cctgttccct cactccccgg agcgggctct cttggcggtg ccatcccccg 360 

acccttcacc ccagggacta ggcgcctgca ctggcgcagc tcgcggagcg ggggccggtc 420 

tcctgctcgg ctgtcgcgtc tec atg teg gat aac cag age tgg aac teg teg 473 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser 
15 10 

gge teg gag gag gat ccg gag acg gag tec ggg ccg cct gtg gag cgc 521 
Gly Ser Glu Glu Asp Pro Glu Thr Glu Ser Gly Pro Pro Val Glu Arg 
15 20 25 

tgc ggg gtc etc age aag tgg aca aac tat att cat gga tgg cag gat 569 
Cys Gly Val Leu Ser Lys Trp Thr Asn Tyr He His Gly Trp Gin Asp 
3 0 3 5 4 0 

cgt tgg gta gtt ttg aaa aat aat act ttg agt tac tac aaa tct gaa 617 
Arg Trp Val Val Leu Lys Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu 
4 5 5 0 55 
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gat gaa aca gaa tat ggc tgt agg gga tec ate tgt ctt age aag get 
Asp Glu Thr Glu Tyr Gly Cys Arg Gly Ser lie Cys Leu Ser Lys Ala 
60 65 70 



665 



gtg ate acg cct cac gat ttt gat gaa tgc egg ttt gat ate agt gta 713 

Val lie Thr Pro His Asp Phe Asp Glu Cys Arg Phe Asp lie Ser Val 
75 80 85 90 

aat gat agt gtt tgg tac ctt cga get cag gac ccg gag cac aga cag 761 

Asn Asp Ser Val Trp Tyr Leu Arg Ala Gin Asp Pro Glu His Arg Gin 

95 100 105 



caa tgg gta gac gee att gaa cag cac aag act gaa teg gga tat gga 
Gin Trp Val Asp Ala lie Glu Gin His Lys Thr Glu Ser Gly Tyr Gly 
110 115 120 



809 



tct gag tec age ttg cgt aga cat ggc tea atg gtg tea ctg gtg tct 857 
Ser Glu Ser Ser Leu Arg Arg His Gly Ser Met Val Ser Leu Val Ser 
125 130 135 

gga gcg agt ggc tat tct get acg tec ace tct tct ttc aag aaa ggc 905 
Gly Ala Ser Gly Tyr Ser Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly 
14 0 14 5 150 

cac agt tta cgt gag aaa ctg get gaa atg gag aca ttt egg gac ate 953 
His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr Phe Arg Asp lie 
155 160 165 ~ " 170 

ctg tgc egg cag gtt gat act etc cag aag tac ttt gat gtc tgt get 1001 
Leu Cys Arg Gin Val Asp Thr Leu Gin Lys Tyr Phe Asp Val Cvs Ala 
175 180 ^ 185 

gac get gtc tec aag gat gag ctt cag agg gat aaa gtc gta gaa gat 1049 
Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys Val Val Glu Asp 
190 195 200 

gat gaa gat gac ttc cct aca act cgt tct gat gga gac ttt ttg cac 1097 
Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly Asp Phe Leu His 
205 210 215 

aat ace aat ggt aat aaa gaa aaa tta ttt cca cat gta aca cca aaa 1145 
Asn Thr Asn Gly Asn Lys Glu Lys Leu Phe Pro His Val Thr Pro Lys 
220 225 230 

gga att aat ggc ata gac ttt aaa ggg gaa gca ata act ttt aaa gca 1193 
Gly lie Asn Gly lie Asp Phe Lys Gly Glu Ala lie Thr Phe Lys Ala 
235 240 245 250 

act act get gga ate ctt get aca ctt tct cat tgt att gaa tta atg 1241 
Thr Thr Ala Gly lie Leu Ala Thr Leu Ser His Cys lie Glu Leu Met 
255 260 265 

gta aaa egg gaa gag age tgg caa aaa aga cac gat agg gaa gtg gaa 12 8 9 
Val Lys Arg Glu Glu Ser Trp Gin Lys Arg His Asp Arg Glu Val Glu 
270 275 280 

aag agg aga cga gtg gag gaa gcg tac aag aat gtg atg gaa gaa ctt 13 3 7 
Lys Arg Arg Arg Val Glu Glu Ala Tyr Lys Asn Val Met Glu Glu Leu 
285 290 295 

aag aag aaa ccc cgt ttc gga ggg ccg gat tat gaa gaa ggt cca aac 13 85 
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Lys Lys Lys Pro Arg Phe Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn 
300 305 310 

agt ctg att aat gag gaa gag ttc ttt gat get gtt gaa get get ctt 

Ser Leu lie Asn Glu Glu Glu Phe Phe Asp Ala Val Glu Ala Ala Leu 
315 320 325 330 

gac aga caa gat aaa ata gag gaa cag tea cag agt gaa aag gtc agg 

Asp Arg Gin Asp Lys lie Glu Glu Gin Ser Gin Ser Glu Lys Val Arg 

335 340 345 



ggg acg cat aga ttt gta caa aag ccc tat agt cgc tct tec tec atg 
Gly Thr His Arg Phe Val Gin Lys Pro Tyr Ser Arg Ser Ser Ser Met 
365 370 375 



tec cag gtt gaa gaa atg gta cag aac cac atg aac tat tea tta cag 
Ser Gin Val Glu Glu Met Val Gin Asn His Met Asn Tyr Ser Leu Gin 
395 400 405 410 



atg aag gta tac aga aga gaa gtg gaa gaa aat gga att gtt ctg gat 
Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly lie Val Leu Asp 
430 435 440 



1433 



1481 



tta cac tgg ccc aca tea ttg cca tct gga gac acc ttt tct tct gtc 1529 
Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Thr Phe Ser Ser Val 
350 355 360 



1577 



tct tec att gat eta gtc agt gee tct gac gat gtt cac aga ttc age 1625 
Ser Ser lie Asp Leu Val Ser Ala Ser Asp Asp Val His Arg Phe Ser 
380 385 390 



tgc aat tac ttt tgg aat gtt gat gtt cgc aat gac tgg gaa act act 
Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu Thr Thr 
460 465 470 

ata gaa aac ttt cat gtg gtg gaa aca tta get gat aat gca ate ate 
Tie Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala lie lie 
475 480 485 490 

gtt tat caa acg cac aag aga gta tgg ccc get tct cag aga gac gta 
Val Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg Asp Val 
495 500 505 

ctg tat ctt tct get att cga aag ate cca gee ttg act gaa aat gat 
Leu Tyr Leu Ser Ala lie Arg Lys lie Pro Ala Leu Thr Glu Asn Asp 
510 515 520 

cct gaa act tgg ata gtt tgt aat ttt tct gtg gat cat gat agt get 
Pro Glu Thr Trp lie Val Cys Asn Phe Ser Val Asp His Asp Ser Ala 
525 530 535 

cct ctg aac aat cga tgt gtc cgt gee aaa ate aat att get atg att 
Pro Leu Asn Asn Arg Cys Val Arg Ala Lys lie Asn lie Ala Met He 



1673 



gat gta ggt ggt gat gca aat tgg caa ctg gtt gtt gaa gaa gga gaa 1721 
Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu Gly Glu 
415 420 425 



1769 



cct ttg aaa get act cat gca gtt aaa ggt gtt aca gga cat gag gtc 1817 
Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His Glu Val 
445 450 455 



1865 



1913 



1961 



2009 



2057 



2105 
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540 545 550 

tgt caa act: tta gta age cca cca gag gga gac cag gag ata age aga 2153 
Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asp Gin Glu lie Ser Arg 
555 560 565 570 

gac aac att ctg tgc aag ate acg tat gta get aat gtg aac cca gga 2201 
Asp Asn lie Leu Cys Lys lie Thr Tyr Val Ala Asn Val Asn Pro Gly 
575 580 585 

gga tgg gcg cca get teg gtc tta aga gca gtg gca aag cga gaa tac 224 9 
Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg Glu Tyr 
590 595 600 

cct aag ttt eta aaa cgt ttt act tct tat gtc caa gaa aaa act gca 2297 
Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala 
605 610 615 

gga aaa cca att ttg ttt tagtattaac agtgactgaa geaaggctge 2345 
Gly Lys Pro lie Leu Phe 
620 

gtgacgttcc atgttggaga aaggagggaa aaaataaaaa gaatcctcta agctggaacg 2405 

taggatctac agecttgtet gtggcccaag aagaaacatt geaategtaa agctgggtat 2465 

ccagcactag ccatctcctg ctaggcctcc tcgctcagcg tgtaactata aatacatgta 2525 

gaatcacatg gatatggcta tatttttatt tgcttgctcc ttggagtgaa aacaaataac 2585 

tttgaattac aactaggaat taaccgatgc tttaattttg aggaactttt tcagaatttt 2645 

ttatttacca tggtccaacc taagatcctc agttgtatca agtttttgtg cacaaaagaa 2705 

aagcacaaaa gttgaacgea cctgaaggca tgtgctctct gtgcaacaaa tactcag 2762 

<210> 4 
<211> 624 
<212> PRT 

<213> Mus musculus 
<400> 4 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
15 10 15 

Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys 
20 25 30 

Trp Thr Asn Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
35 40 45 

Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 55 60 

Cys Arg Gly Ser lie Cys Leu Ser Lys Ala Val lie Thr Pro His Asp 
65 70 75 80 

Phe Asp Glu Cys Arg Phe Asp lie Ser Val Asn Asp Ser Val Trp Tyr 
85 90 95 
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Leu Arg Ala Gin Asp Pro Glu His Arg Gin Gin Trp Val Asp Ala He 
100 105 no 

Glu Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
115 120 125 

Arg His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 
130 135 140 

Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 
145 150 155 160 

Leu Ala Glu Met Glu Thr Phe Arg Asp He Leu Cys Arg Gin Val Asp 
165 170 175 

Thr Leu Gin Lys Tyr Phe Asp Val Cys Ala Asp Ala Val Ser Lys Asp 
180 185 190 

Glu Leu Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro 
195 200 205 

Thr Thr Arg Ser Asp Gly Asp Phe Leu His Asn Thr Asn Gly Asn Lys 
210 215 220 

Glu Lys Leu Phe Pro His Val Thr Pro Lys Gly He Asn Gly He Asp 
225 230 235 240 

Phe Lys Gly Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu 
245 250 255 

Ala Thr Leu Ser His Cys He Glu Leu Met Val Lys Arg Glu Glu Ser 
260 265 270 

Trp Gin Lys Arg His Asp Arg Glu Val Glu Lys Arg Arg Arg Val Glu 
275 280 285 

Glu Ala Tyr Lys Asn Val Met Glu Glu Leu Lys Lys Lys Pro Arg Phe 
290 295 300 

Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu He Asn Glu Glu 
305 310 315 320 

Glu Phe Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys He 
325 330 335 

Glu Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Pro Thr Ser 
340 345 350 

Leu Pro Ser Gly Asp Thr Phe Ser Ser Val Gly Thr His Arg Phe Val 
355 360 365 

Gin Lys Pro Tyr Ser Arg Ser Ser Ser Met Ser Ser He Asp Leu Val 
370 375 380 

Ser Ala Ser Asp Asp Val His Arg Phe Ser Ser Gin Val Glu Glu Met 
385 390 395 400 

Val Gin Asn His Met Asn Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala 
405 410 415 

Asn Trp Gin Leu Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg 
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420 425 430 

Glu Val Glu Glu Asn Gly He Val Leu Asp Pro Leu Lys Ala Thr His 
435 440 445 

Ala Val Lys Gly Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn 
450 455 460 

Val Asp Val Arg Asn Asp Trp Glu Thr Thr He Glu Asn Phe His Val 
465 470 475 480 

Val Glu Thr Leu Ala Asp Asn Ala He He Val Tyr Gin Thr His Lys 
485 490 495 

Arg Val Trp Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Ala He 
500 505 510 

Arg Lys He Pro Ala Leu Thr Glu Asn Asp Pro Glu Thr Trp He Val 
515 520 525 

Cys Asn Phe Ser Val Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys 
530 535 540 

Val Arg Ala Lys lie Asn He Ala Met He Cys Gin Thr Leu Val Ser 
545 550 555 560 

Pro Pro Glu Gly Asp Gin Glu He Ser Arg Asp Asn He Leu Cys Lys 
565 570 575 

He Thr Tyr Val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser 
580 585 590 

Val Leu Arg Ala Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg 
595 600 605 

Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro He Leu Phe 
610 615 620 



<210> 5 
<211> 2361 
<212> DNA 
<213> Bos taurus 

<220> 
<221> CDS 

<222> (421) . . (2292) 
<400> 5 

cggcaggaag atggcggcct agcggaggtg tgagtggacc tgggtctctg cagctgggtt 60 
ttccctcttc ccgtctttct cctcttttcc tctcccccga ggttggcatc gagggggcca 120 
aattcgggcg gcggcgccgg gcgcagcgca ggggtcacaa cgacggcgac ggctgacggt 180 
tggaagggca ggctcccttc gcccctcgac ctccttcccc ggtccgcttg gtgtcaggcg 240 
cggcggcggc ggcggcggcg gcgcggcggg cggactccat ccctcctccc gctccctcct 300 
gcaccggagc gggcactcct tccttcgcca tcccccgacc cttcaccccg gggactgggc 360 
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gcctccaccg gcgcagctca gggagcgggg gccggtctcc tgctcggctg tcgcgcctcc 420 

atg teg gat aac cag age tgg aac teg teg ggc teg gag gag gat ccg 468 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
15 10 15 

gag acg gag tec ggg ccg ccg gtg gag cgc tgc gga gtc etc aac aag 516 

Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Asn Lys 
20 25 30 



tgg aca aac tat att cat ggg tgg cag gat cgc tgg gta gtt ttg aaa 
Trp Thr Asn Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
35 40 45 



564 



aat aac act ctg agt tac tac aaa tct gaa gat gag aca gag tat ggc 612 
Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 55 60 



tgc aga gga tec ate tgt ctt age aag get gtc ate acg cct cat gat 
Cys Arg Gly Ser lie Cys Leu Ser Lys Ala Val He Thr Pro His Asp 
65 " * 70 75 80 

ttt gat gaa tgc cga ttt gat att agt' gta aat gat agt gtt tgg tat 
Phe Asp Glu Cvs Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr 
85 90 95 



660 



708 



ctt cgt get caa gat cca gat cac aga cag cag tgg ata gat gee att 756 
Leu Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He 
100 105 HO 



gaa cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt 
Glu Gin His Lvs Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
115 " 120 125 

cga cat ggc tec atg gta tea ttg gta tec gga gca agt ggc tat tct 
Arg His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 
130 135 140 

gca aca tec ace tec tea ttc aag aag ggc cac agt tta cgt gag aaa 
Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 
145 150 155 160 

ctg get gaa atg gaa ace ttt aga gat ata ctg tgt aga caa gtt gat 
Leu Ala Glu Met Glu Thr Phe Arg Asp He Leu Cys Arg Gin Val Asp 
165 170 175 

acc eta cag aag ttc ttt gat gee tgt get gat get gtc tec aag gat 
Thr Leu Gin Lvs Phe Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp 
180 185 190 

gaa ttt caa agg gat aaa gtg gta gaa gat gat gaa gat gac ttt cct 
Glu Phe Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro 
195 ~ ' 200 205 

acg aca cgt tct gat gga gac ttc ttg cat aat acc aat ggc aat aag 
Thr Thr Arg Ser Asp Gly Asp Phe Leu His Asn Thr Asn Gly Asn Lys 
210 215 22 0 

gaa aag gta ttt cca cat gta aca cca aaa gga att aat ggt ata gac 
Glu Lys Val Phe Pro His Val Thr Pro Lys Gly He Asn Gly He Asp 
225 230 235 240 



804 



852 



900 



948 



996 



1044 



1092 



1140 
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aaa ggt 
Phe Lys Giy 



get aca ctt 
Ala Thr Leu 



tag caa aag 

Trp Gin Lys 
275 

gaa gca tac 

Glu Ala Tyr 
290 

gga gga cca 

Gly Giy Pro 
305 

gag ::c 
Git- Pr.e Pne 



gaa gaa cag 

Glu Glu Gin 



atg cca tct 
Met Pre ^er 

25S 

caa aaq ccc 
Gin Lyc Pro 

age gec tct 
Ser A: a i>r 

at g cag aac 
Val Gin Asn 



aac tgg cag 
Asn Trp Gin 



gaa at j aaa 

Glu Val CI -J 
435 

gca get aaa 
Ala Val Lys 
450 

gtt gat ate 

Va 1 As t Va 1 
465 



gag gcg ata 
Glu Ala He 
245 

tct cat tgt 
Ser His Cys 
260 

aga atg gac 
Arg Met Asp 



aaa aat gec 
Lys Asn Ala 



gat tat gag 
Asp Tyr Glu 
310 

gat get gtt 
Asp Ala Val 
325 

teg cag agt 
Ser Gin Ser 
340 

gga gat gec 
Giy Asp Ala 



tat agt cgc 
Tyr Ser Arg 



gac ggt gtt 
Asp Gly Val 
390 

cac atg acc 
His Met Thr 
405 

ttg gtt gta 
Leu Val Val 
420 

gaa aat ggg 
Glu Asn Gly 



ggc gtt aca 
Gly Val Thr 



cgc aat gat 
Arg Asn Asp 
470 



act ttt aaa 
Thr Phe Lys 



att gag ctg 
He Glu Leu 
265 

aag gaa act 
Lys Glu Thr 
280 

atg aca gaa 
Met Thr Glu 
295 

gaa ggc cca 
Glu Gly Pro 



gaa get get 
Glu Ala Ala 



gaa aag gtc 
Glu Lys Val 
345 

ttt tct tct 
Phe Ser Ser 
360 

tct tec tec 
Ser Ser Ser 

375 

cac aga ttc 
His Arg Phe 



tat tea ttg 
Tyr Ser Leu 



gaa gaa ggg 
Glu Glu Gly 
425 

att gtt ctg 
He Val Leu 
440 

gga cac gag 

Gly His Glu 
455 

tgg g^a aca 

Trp Glu Thr 



gca act act 
Ala Thr Thr 
250 

atg gta aaa 
Met Val Lys 



gag aag aga 
Glu Lys Arg 



ctt aag aaa 
Leu Lys Lys 
300 

aac agt ttg 
Asn Ser Leu 
315 

ctt gac aga 
Leu Asp Arg 
330 

agg tta cat 
Arg Leu His 



gtg ggg act 

Val Gly Thr 



atg tct tec 
Met Ser Ser 
380 

age tec cag 
Ser Ser Gin 
395 

cag gat gta 
Gin Asp Val 
410 

gag atg aag 
Glu Met Lys 



gat cct ttg 
Asp Pro Leu 



gtc tgc aat 
Val Cys Asn 
460 

act ata gaa 
Thr He Glu 
475 



gee gga ate 
Ala Gly He 
255 

cgt gag gac 
Arg Glu Asp 
270 

aga aga gtg 
Arg Arg Val 
285 

aaa tec cac 
Lys Ser His 



att aat gaa 
He Asn Glu 



caa gat aaa 
Gin Asp Lys 

335 

tgg tct act 
Trp Ser Thr 
350 

cat aga ttt 

His Arg Phe 
365 

att gat eta 

He Asp Leu 



gtt gaa gag 
Val Glu Glu 



ggt ggg gac 
Gly Gly Asp 
415 

gta tat aga 
Val Tyr Arg 
430 

aaa get acc 

Lys Ala Thr 
445 

tac ttc tgg 

Tyr Phe Trp 



aac ttt cat 
Asn Phe His 



ctt 1188 
Leu 



age 1236 
Ser 



gag 1284 
Glu 



ttt 1332 
Phe 



gag 1380 

Glu 

320 

ata 1428 
He 



tea 1476 
Ser 



gtc 1524 
Val 



gtc 1572 
Val 



atg 1620 

Met 

400 

gee 1668 
Ala 



aga 1716 
Arg 



cat 1764 
His 



aat 1812 
Asn 



gtg 1860 

Val 

480 
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gtg gaa aca tta get gat aat gca ate ate att tat caa acg cac aag 1908 
Val Glu Thr Leu Ala Asp Asn Ala lie lie He Tyr Gin Thr His Lys 
485 490 495 

aga gtg tgg cca gee tct cag egg gat gtc tta tat ctg tct gec att 1956 
Arg Val Trp Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Ala He 
500 505 510 

cga aag ata cca get ttg aat gaa aat gac ccg gag act tgg ata gtt 2004 
Arg Lys He Pro Ala Leu Asn Glu Asn Asp Pro Glu Thr Trp He Val 
515 520 525 

tgt aat ttt tct gta gat cac age agt get cct eta aac aat cga tgt 2 052 
Cys Asn Phe Ser Val Asp His Ser Ser Ala Pro Leu Asn Asn Arg Cys 
530 535 540 

gtc cgt gee aaa ata aac gtt get atg att tgt cag acc ttg gtg age 2100 
Val Arg Ala Lys He Asn Val Ala Met He Cys Gin Thr Leu Val Ser 
545 550 555 560 

ccc cca gag gga aac cag gag att age agg gac aac att eta tgc aag 2148 
Pro Pro Glu Gly Asn Gin Glu He Ser Arg Asp Asn He Leu Cys Lys 
565 570 575 

att aca tac gtg gee aat gta aac cct gga gga tgg gec cca gee tea 2196 
He Thr Tyr Val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser 
580 585 590 

gtg tta egg gca gtg gca aag cga gaa tat cca aag ttt eta aag cgt 2244 
Val Leu Arg Ala Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg 
595 600 605 

ttt act tct tac gta caa gaa aaa act gca gga aaa cct att ttg ttc 2292 
Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro He Leu Phe 
610 615 620 

tagtattaac agtgactgaa gcaaggctgt gtgacattcc atgttggagg aaaaaaaaaa 2352 

aaaaaaaaa 2361 



<210> 6 
<211> 624 
<212> PRT 
<213> Bos taurus 

<400> 6 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
15 10 15 

Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Asn Lys 
20 25 30 

Trp Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
35 40 45 

Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 55 60 

Cys Arg Gly Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp 
65 70 75 80 



14 



OOCID <WO 0050607A2_I_> 



WO 00/50607 



PCT/IB00/00324 



Phe Asp Glu Cys 



Leu Arg Ala Gin 
100 

Glu Gin His Lys 
115 

Arg His Gly Ser 
130 

Ala Thr Ser Thr 
145 

Leu Ala Glu Met 



Thr Leu Gin Lys 
180 

31 u Phe Gin Arg 
195 

Thr Thr Arg Ser 
210 



Arg Phe Asp He 
85 

Asp Pro Asp His 



Thr Glu Ser Gly 
120 

Met Val Ser Leu 
135 

Ser Ser Phe Lys 
150 

Glu Thr Phe Arg 
165 

Phe Phe Asp Ala 



Asp Lys Val Val 
200 

Asp Gly Asp Phe 
215 



Ser Val Asn Asp 
90 

Arg Gin Gin Trp 
105 

Tyr Gly Ser Glu 



Val Ser Gly Ala 
140 

Lys Gly His Ser 
155 

Asp He Leu Cys 
170 

Cys Ala Asp Ala 
185 

Glu Asp Asp Glu 



Leu His Asn Thr 
220 



Ser Val Trp Tyr 
95 

He Asp Ala He 
110 

Ser Ser Leu Arg 
125 

Ser Gly Tyr Ser 



Leu Arg Glu Lys 
160 

Arg Gin Val Asp 
175 

Val Ser Lys Asp 
190 

Asp Asp Phe Pro 
205 

Asn Gly Asn Lys 



G.u Lyc Val Phe Pro His Val Thr Pro Lys Gly He Asn Gly He Asp 
225 230 235 240 

?he Lvc Gly Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu 
245 250 255 

A At i Thr Leu Ser His Cys He Glu Leu Met Val Lys Arg Glu Asp Ser 
260 265 " 270 

Trp Gin Lys Arg Met Asp Lys Glu Thr Glu Lys Arg Arg Arg Val Glu 
275 280 285 



Ala Tyr Lys Asn Ala Met Thr Glu Leu Lvs Lys Lys Ser His Phe 

2*0 295 ' 300 

G2v Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu He Asn Glu Glu 

305 310 315 320 

Glu Pne Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys He 

325 330 335 

Glu Clu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Ser Thr Ser 

340 345 350 

Met Pro Ser Gly Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val 

355 360 365 

Gin Lys Pro Tyr Ser Arg Ser Ser Ser Met Ser Ser lie Asp Leu Val 

370 375 380 

Ser Ala Ser Asp Gly Val His Arg Phe Ser Ser Gin Val Glu Glu Met 

385 390 395 400 
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Val Gin Asn His Met Thr Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala 
405 410 415 

Asn Trp Gin Leu Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg 
420 425 430 

Glu Val Glu Glu Asn Gly lie Val Leu Asp Pro Leu Lys Ala Thr His 
435 440 445 

Ala Val Lys Gly Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn 
450 455 460 

Val Asp Val Arg Asn Asp Trp Glu Thr Thr lie Glu Asn Phe His Val 
465 470 475 480 

Val Glu Thr Leu Ala Asp Asn Ala lie lie lie Tyr Gin Thr His Lys 
485 490 495 

Arg Val Trp Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Ala lie 
500 505 510 

Arg Lys He Pro Ala Leu Asn Glu Asn Asp Pro Glu Thr Trp He Val 
515 520 525 

Cys Asn Phe Ser Val Asp His Ser Ser Ala Pro Leu Asn Asn Arg Cys 
530 535 540 

Val Arg Ala Lys He Asn Val Ala Met He Cys Gin Thr Leu Val Ser 
545 ™ 550 555 560 

Pro Pro Glu Gly Asn Gin Glu He Ser Arg Asp Asn He Leu Cys Lys 
565 570 575 

lie Thr Tyr Val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser 
580 585 590 

Val Leu Arg Ala Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg 
595 600 605 

Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro He Leu Phe 
610 615 620 



<210> 7 

<211> 2187 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (391) . . (2187) 
<400> 7 

tagcggaggt gtgagtggac gcgggactca gcggccggat tttctcttcc cttcttttcc 60 
cttttccttc cctatttgaa attggcatcg agggggctaa gttcgggtgg cagcgccggg 120 
cgcaacgcag gggtcacggc gacggcggcg gcggctgacg gctggaaggg taggcttcat 18 0 
tcaccgctcg tcctccttcc tcgctccgct cggtgtcagg cgcggcggcg gcgcggcggg 24 0 



16 



5DOCID <WO 0050607A2 l_> 



WO 00/50607 



PCT/IBOO/00324 



cggacttcgt ccctcctcct gctccccccc acaccggagc gggcactctt cgcttcgcca 300 

tcccccgacc cttcaccccg aggactgggc gcctcctccg gcgcagctga gggagcgggg 360 

gccggtctcc tgctcggttg tcgagcctcc atg teg gat aat cag age tgg aac 414 

Met Ser Asp Asn Gin Ser Trp Asn 
1 5 

teg teg ggc teg gag gag gat cca gag acg gag tct ggg ccg cct gtg 462 
Ser Ser Gly Ser Glu Glu Asp Pro Glu Thr Glu Ser Gly Pro Pro Val 
10 15 20 

gag cgc tgc ggg gtc etc agt aag tgg aca aac tac att cat ggg tgg 510 
Glu Arg Cys Gly Val Leu Ser Lys Trp Thr Asn Tyr lie His Gly Trp 
25 30 35 40 

cag gat cgt tgg gta gtt ttg aaa aat aat get ctg agt tac tac aaa 558 
Gin Asp Arg Trp Val Val Leu Lys Asn Asn Ala Leu Ser Tyr Tyr Lys 
45 50 55 

tct gaa gat gaa aca gag tat ggc tgc aga gga tec ate tgt ctt age 606 
Ser Glu Asp Glu Thr Glu Tyr Gly Cys Arg Gly Ser lie Cys Leu Ser 
60 65 70 

aag get gtc ate aca cct cac gat ttt gat gaa tgt cga ttt gat att 654 
Lys Ala Val lie Thr Pro His Asp Phe Asp Glu Cys Arg Phe Asp lie 
75 80 85 

agt gta aat gat agt gtt tgg tat ctt cgt get cag gat cca gat cat 702 
Ser Val Asn Asp Ser Val Trp Tyr Leu Arg Ala Gin Asp Pro Asp His 
90 95 100 

aga cag caa tgg ata gat gee att gaa cag cac aag act gaa tct gga 750 
Arg Gin Gin Trp lie Asp Ala lie Glu Gin His Lys Thr Glu Ser Gly 
105 110 115 120 

tat gga tct gaa tec age ttg cgt cga cat ggc tea atg gtg tec ctg 798 
Tyr Gly Ser Glu Ser Ser Leu Arg Arg His Gly Ser Met Val Ser Leu 
125 130 135 

gtg tct gga gca agt ggc tac tct gca aca tec ace tct tea ttc aag 84 6 
Val Ser Gly Ala Ser Gly Tyr Ser Ala Thr Ser Thr Ser Ser Phe Lys 
140 145 150 

aaa ggc cac agt tta cgt gag aag ttg get gaa atg gaa aca ttt aga 894 
Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr Phe Arg 
155 160 165 

gac ate tta tgt aga caa gtt gac acg eta cag aag tac ttt gat gee 942 
Asp lie Leu Cys Arg Gin Val Asp Thr Leu Gin Lys Tyr Phe Asp Ala 
170 175 180 

tgt get gat get gtc tct aag gat gaa ctt caa agg gat aaa gtg gta 990 
Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys Val Val 
185 190 195 200 

gaa gat gat gaa gat gac ttt cct aca acg cgt tct gat ggt gac ttc 103 8 
Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly Asp Phe 
205 210 215 

ttg cat agt ace aac ggc aat aaa gaa aag tta ttt cca cat gtg aca 1086 
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Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu Phe Pro His Val Thr 
220 225 230 

cca aaa gga att aat ggt ata gac ttt aaa ggg gaa gcg ata act ttt 1134 
Pro Lys Gly lie Asn Gly He Asp Phe Lys Gly Glu Ala He Thr Phe 
235 240 245 

aaa gca act act get gga ate ctt gca aca ctt tct cat tgt att gaa 1182 
Lys Ala Thr Thr Ala Gly He Leu Ala Thr Leu Ser His Cys He Glu 
250 255 260 

eta atg gtt aaa cgt gag gac age tgg cag aag aga ctg gat aag gaa 1230 
Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys Arg Leu Asp Lys Glu 
265 270 275 280 

act gag aag aaa aga aga aca gag gaa gca tat aaa aat gca atg aca 1278 
Thr Glu Lys Lys Arg Arg Thr Glu Glu Ala Tyr Lys Asn Ala Met Thr 
285 290 295 

gaa ctt aag aaa aaa tec cac ttt gga gga cca gat tat gaa gaa ggc 1326 
Glu Leu Lys Lys Lys Ser His Phe Gly Gly Pro Asp Tyr Glu Glu Gly 
300 305 310 

cc: aac agt ctg att aat gaa gaa gag ttc ttt gat get gtt gaa get 1374 
Pro Asn Ser Leu He Asn Glu Glu Glu Phe Phe Asp Ala Val Glu Ala 
315 320 325 

a:: ctt gac aga caa gat aaa ata gaa gaa cag tea cag agt gaa aag 1422 
Ala Leu Asp Arg Gin Asp Lys He Glu Glu Gin Ser Gin Ser Glu Lys 
330 335 340 

gtg aga tta cat tgg cct aca tec ttg ccc tct gga gat gee ttt tct 1470 
Val Arg Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Ala Phe Ser 
345 350 355 360 

♦rt gtg ggg aca cat aga ttt gtc caa aag gtt gaa gag atg gtg cag 1518 
Ser Val Gly Thr His Arg Phe Val Gin Lys Val Glu Glu Met Val Gin 
365 370 375 

i»ac cac atg act tac tea tta cag gat gta ggc gga gat gee aat tgg 1566 
Ai:n K;s Met Thr Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp 
380 385 390 

cag ttg gtt gta gaa gaa gga gaa atg aag gta tac aga aga gaa gta 1614 
Clr. Leu Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg Glu Val 
395 400 405 

gaa gaa aat ggg att gtt ctg gat cct tta aaa get acc cat gca gtt 1662 
Glu Glu Asn Gly He Val Leu Asp Pro Leu Lys Ala Thr His Ala Val 
41C 415 420 

aaa ggc gtc aca gga cat gaa gtc tgc aat tat ttc tgg aat gtt gac 1710 
Lys Gly Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val Asp 
425 430 435 440 

gtt cgc aat gac tgg gaa aca act ata gaa aac ttt cat gtg gtg gaa 1758 
Val Arg Asn Asp Trp Glu Thr Thr He Glu Asn Phe His Val Val Glu 
445 450 455 

aca tta get gat aat gca ate ate att tat caa aca cac aag agg gtg 1806 
Thr Leu Ala Asp Asn Ala He He He Tyr Gin Thr His Lys Arg Val 
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460 465 470 

tgg cct get tct cag cga gac gta tta tat ctt tct gtc att cga aag 1854 

Trp Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Val lie Arg Lys 
475 480 485 

ata cca gcc ttg act gaa aat gac cct gaa act tgg ata gtt tgt aat 1902 

lie Pro Ala Leu Thr Glu Asn Asp Pro Glu Thr Trp lie Val Cys Asn 

490 495 500 

ttt tct gtg gat cat gac agt get cct eta aac aac cga tgt gtc cgt 1950 

Phe Ser Val Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys Val Arg 

505 510 515 520 

gcc aaa ata aat gtt get atg att tgt caa acc ttg gta age cca cca 1998 

Ala Lys lie Asn Val Ala Met He Cys Gin Thr Leu Val Ser Pro Pro 
525 530 535 

gag gga aac cag gaa att age agg gac aac att eta- tgc aag att aca 2 04 6 

Glu Gly Asn Gin Glu He Ser Arg Asp Asn He Leu Cys Lys He Thr 

540 545 ' 550 

tat gta get aat gtg aac cct gga gga tgg gca cca gcc tea gtg tta 2 094 

Tyr Val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser Val Leu 
555 560 565 

agg gca gtg gca aag cga gag tat cct aaa ttt eta aaa cgt ttt act 2142 

Arg Ala Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr 

570 575 580 

tct tac gtc caa gaa aaa act gca gga aag cct ant ttg ttc tag 2187 

Ser Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro He Leu Phe 

585 590 595 



<210> 8 

<211> 598 

<212> PRT 

< 2 1 3 > Homo s ap i ens 

<400> 8 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
1 5 10 15 

Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys 
20 25 30 

Trp Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
35 40 45 

Asn Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 55 60 

Cys Arg Gly Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp 
65 70 75 80 

Phe Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr 
85 90 95 

Leu Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He 
100 105 110 



19 



SDOCtD: <WO 00S0607A2J_> 



WO 00/50607 



PCT/IB00/00324 



Glu Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
115 120 125 

Arg His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 
130 135 140 

Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 
145 150 155 160 

Leu Ala Glu Met Glu Thr Phe Arg Asp lie Leu Cys Arg Gin Val Asp 
165 170 * ~ 175 

Thr Leu Gin Lys Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp 
180 185 190 

Glu Leu Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro 
195 200 205 

Thr Thr Arg Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys 
210 215 220 

Glu Lys Leu Phe Pro His Val Thr Pro Lys Gly lie Asn Gly lie Asp 
225 230 235 240 

Phe Lvs Gly Glu Ala lie Thr Phe Lys Ala Thr Thr Ala Gly lie Leu 
245 250 255 

Ala Thr Leu Ser His Cys lie Glu Leu Met Val Lys Arg Glu Asp Ser 
260 265 270 

Trp Gin Lys Arg Leu Asp Lys Glu Thr Glu Lys Lys Arg Arg Thr Glu 
275 280 285 

Glu Ala Tyr Lys Asn Ala Met Thr Glu Leu Lys Lys Lys Ser His Phe 
290 295 300 

Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu lie Asn Glu Glu 
305 " 310 315 320 

Glu Phe Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys lie 
325 330 335 

Glu Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Pro Thr Ser 
340 345 350 

Leu Pro Ser Gly Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val 
355 360 365 

Gin Lvs Val Glu Glu Met Val Gin Asn His Met Thr Tyr Ser Leu Gin 
370 375 380 

Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu Gly Glu 
385 390 395 400 

Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly lie Val Leu Asp 
405 410 415 

Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His Glu Val 
420 425 430 
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Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu Thr Thr 
435 440 445 

lie Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala lie lie 
450 455 460 

lie Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg Asp Val 
465 470 475 480 

Le'u Tyr Leu Ser Val lie Arg Lys lie Pro Ala Leu Thr Glu Asn Asp 
485 490 495 

Pro Glu Thr Trp lie Val Cys Asn Phe Ser Val Asp His Asp Ser Ala 
500 505 510 

Pro Leu Asn Asn Arg Cys Val Arg Ala Lys lie Asn Val Ala Met He 
515 520 525 

Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn Gin Glu He Ser Arg 
530 535 540 

Asp Asn He Leu Cys Lys He Thr Tyr Val Ala Asn Val Asn Pro Gly 
545 550 555 560 

Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg Glu Tyr 
565 570 * 575 

Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala 
580 585 590 

Gly Lys Pro He Leu Phe 
5 95 



<210> 9 
<211> 2684 
<212> DNA 

<213> Mus musculus 

<220> 
<221> CDS 

<222> (444) . . (2237) 
<400> 9 

cgggccacca cgtgtaaata gtatcggacc cggcaggaag atggcggctg tagcggaggt 60 

gtgagtgagt ggatctgggt ctctgccgtt ggcttggctc ttcccgtctt cctcccctcc 120 

cccctccctg actgaggttg gcatctaggg ggccgagttc aggtggcggc gccgggcgca 180 

gcgcaggggt cacggccacg gcggctgacg gctggaaggg caggctttct tcgccgctcg 240 

tcctccttcc ccggtccgct cggtgtcagg cgcggcggcg gcggcgcggc gggcgcgctt 3 00 

cgtccctctt cctgttccct cactccccgg agcgggctct cttggcggtg ccatcccccg 360 

acccttcacc ccagggacta ggcgcctgca ctggcgcagc tcgcggagcg ggggccggtc 420 

tcctgctcgg ctgtcgcgtc tec atg teg gat aac cag age tgg aac teg teg 473 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser 
15 10 
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ggc teg gag gag gat ccg gag acg gag tec ggg ccg cct gtg gag cgc 521 
Gly Ser Glu Glu Asp Pro Glu Thr Glu Ser Gly Pro Pro Val Glu Arg 
15 20 25 

tgc ggg gtc etc age aag tgg aca aac tat att cat gga tgg cag gat 56 9 
Cys Gly Val Leu Ser Lys Trp Thr Asn Tyr lie His Gly Trp Gin Asp 
30 35 40 

cgt tgg gta gtt ttg aaa aat aat act ttg agt tac tac aaa tct gaa 
Arg Trp Val Val Leu Lys Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu 
45 50 55 

gat gaa aca gaa tat ggc tgt agg gga tec ate tgt ctt age aag get 
Asp Glu Thr Glu Tyr Gly Cys Arg Gly Ser lie Cys Leu Ser Lys Ala 

60 " r 5 70 

gtg ate acg cct cac gat ctt gat gaa tgc egg ttt gat ate agt gta 713 
Val lie Thr Pro His Asp Phe Asp Glu Cys Arg Phe Asp lie Ser Val 
75 80 85 90 

aat gat agt gtt tgg tac ctt cga get cag gac ccg gag cac aga cag 761 
Asn Asp Ser Val Trp Tyr Leu Arg Ala Gin Asp Pro Glu His Arg Gin 
95 100 105 



caa tgg gta gac gee att gaa cag cac aag act gaa teg gga tat gga 
Gin Trp Val Asp Ala He Glu Gin His Lys Thr Glu Ser Gly Tyr Gly 
110 115 120 

tct gag tec age ttg cgt aga cat ggc tea atg gtg tea ctg gtg tct 
Ser Glu Ser Ser Leu Arg Arg His Gly Ser Met Val Ser Leu Val Ser 
125 130 135 

gga gcg agt ggc tat tct get acg tec acc tct tct ttc aag aaa ggc 
Gly Ala Ser Gly Tyr Ser Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly 
140 145 150 

cac agt tta cgt gag aaa ctg get gaa atg gag aca ttt egg gac ate 
His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr Phe Arg Asp He 
155 160 165 170 

ctg tgc egg cag gtt gat act etc cag aag tac ttt gat gtc tgt get 
Leu Cys Arg Gin Val Asp Thr Leu Gin Lys Tyr Phe Asp Val Cys Ala 
175 180 185 

gac get gtc tec aag gat gag ctt cag agg gat aaa gtc gta gaa gat 
Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys Val Val Glu Asp 
190 195 200 

gat gaa gat gac ttc cct aca act cgt tct gat gga gac ttt ttg cac 
Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly Asp Phe Leu His 
205 210 215 



617 



665 



809 



857 



905 



953 



1001 



1049 



1097 



aat acc aat ggt aat aaa gaa aaa tta ttt cca cat gta aca cca aaa 1145 
Asn Thr Asn Gly Asn Lys Glu Lys Leu Phe Pro His Val Thr Pro Lys 
220 225 230 



gga att aat ggc ata gac ttt aaa ggg gaa gca ata act ttt aaa gca 
Gly He Asn Gly He Asp Phe Lys Gly Glu Ala He Thr Phe Lys Ala 
235 240 245 250 



1193 
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act act get gga ate ctt get aca ctt tct cat tgt att gaa tta atg 1241 
Thr Thr Ala Gly lie Leu Ala Thr Leu Ser His Cys lie Glu Leu Met 
255 260 265 

gta aaa egg gaa gag age tgg caa aaa aga cac gat agg gaa gtg gaa 12 8 9 
Val Lys Arg Glu Glu Ser Trp Gin Lys Arg His Asp Arg Glu Val Glu 
270 275 ~ 280 

aag agg aga cga gtg gag gaa gcg tac aag aat gtg atg gaa gaa ctt 1337 
Lys Arg Arg Arg Val Glu Glu Ala Tyr Lys Asn Val Met Glu Glu Leu 
285 290 295 

aag aag aaa ccc cgt ttc gga ggg ccg gat tat gaa gaa ggt cca aac 1385 
Lys Lys Lys Pro Arg Phe Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn 
300 305 310 

agt ctg att aat gag gaa gag ttc ttt gat get gtt gaa get get ctt 1433 
Ser Leu lie Asn Glu Glu Glu Phe Phe Asp Ala Val Glu Ala Ala Leu 
315 320 325 330 

gac aga caa gat aaa ata gag gaa cag tea cag agt gaa aag gtc agg 1481 
Asp Arg Gin Asp Lys He Glu Glu Gin Ser Gin Ser Glu Lys Val Arg 
335 340 345 

tta cac tgg ccc aca tea ttg cca tct gga gac ace ttt tct tct gtc 1529 
Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Thr Phe Ser Ser Val 
350 355 * 360 

ggg acg cat aga ttt gta caa aag gtt gaa gaa atg gta cag aac cac 1577 
Gly Thr His Arg Phe Val Gin Lys Val Glu Glu Met Val Gin Asn His 
365 370 375 

atg aac tat tea tta cag gat gta ggt ggt gat gca aat tgg caa ctg 162 5 
Met Asn Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp Gin Leu 
380 385 " 390 

gtt gtt gaa gaa gga gaa atg aag gta tac aga aga gaa gtg gaa gaa 1673 
Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg Glu Val Glu Glu 
395 400 405 410 

aat gga att gtt ctg gat cct ttg aaa get act cat gca gtt aaa ggt 1721 
Asn Gly He Val Leu Asp Pro Leu Lys Ala Thr His Ala Val Lys Gly 
415 420 425 

gtt aca gga cat gag gtc tgc aat tac ttt tgg aat gtt gat gtt cgc 1769 
Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val Asp Val Arg 
430 435 44*0 

aat gac tgg gaa act act ata gaa aac ttt cat gtg gtg gaa aca tta 1817 
Asn Asp Trp Glu Thr Thr He Glu Asn Phe His Val Val Glu Thr Leu 
445 450 455 

get gat aat gca ate ate gtt tat caa acg cac aag aga gta tgg ccc 1865 
Ala Asp Asn Ala He He Val Tyr Gin Thr His Lys Arg Val Trp Pro 
460 465 470 

get tct cag aga gac gta ctg tat ctt tct get att cga aag ate cca 1913 
Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Ala lie Arg Lys He Pro 
475 480 485 490 

gec ttg act gaa aat gat cct gaa act tgg ata gtt tgt aat ttt tct 1961 
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Ala Leu Thr Glu Asn Asp Pro Glu Thr Trp lie Val Cys Asn Phe Ser 
495 500 505 

gtg gat cat gat agt get cct ctg aac aat cga tgt gtc cgt gec aaa 

Val Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys Val Arg Ala Lys 
510 515 520 



2009 



ate aat att get atg att tgt caa act tta gta age cca cca gag gga 2057 
lie Asn lie Ala Met lie Cys Gin Thr Leu Val Ser Pro Pro Glu Gly 
525 530 535 



gac cag gag ata age aga gac aac att ctg tgc aag ate acg tat gta 
Asp Gin Glu He Ser Arg Asp Asn He Leu Cys Lys He Thr Tyr Val 
540 545 550 

get aat gtg aac cca gga gga tgg gcg cca get teg gtc tta aga gca 
Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser Val Leu Arg Ala 
555 560 565 570 

gtg gca aag cga gaa tac cct aag ttt eta aaa cgt ttt act tct tat 
Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr 
575 580 585 



2105 



2153 



2201 



gtc caa gaa aaa act gca gga aaa cca att ttg ttt tagtattaac 2247 
Val Gin Glu Lys Thr Ala Gly Lys Pro He Leu Phe 
590 595 

agtgactgaa geaaggctge gtgacgttcc atgttggaga aaggagggaa aaaataaaaa 2307 

gaatcctcta agctggaacg taggatctac agecttgtet gtggcccaag aagaaacatt 2367 

geaategtaa agctgggtat ccagcactag ccatctcctg ctaggcctcc tcgctcagcg 2427 

tgtaactata aatacatgta gaatcacatg gatatggcta tatttttatt tgcttgctcc 2487 

ttggagtgaa aacaaataac tttgaattac aactaggaat taaccgatgc tttaattttg 2547 

aggaactttt tcagaatttt ttatttacca tggtccaacc taagatcctc agttgtatca 2607 

agtttttgtg cacaaaagaa aagcacaaaa gttgaacgea cctgaaggca tgtgctctct 2667 

gtgcaacaaa tactcag 2684 



<210> 10 

<211> 598 

<212> PRT 

<213> Mus . musculus 

<400> 10 

Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
15 10 15 

Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys 
20 25 30 

Trp Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
3 5 4 0 4 5 

Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 55 60 
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Cys Arg Gly Ser 
65 

Phe Asp Giu Cys 



Leu Arg Ala Gin 
100 

Glu Gin His Lys 
115 

Arg His Gly Ser 
130 

Ala Thr Ser Thr 
145 

Leu Ala Glu Met 



Thr Leu C i r. Lys 
180 

Glu Leu Gin Arg 

1 95 

Thr Tr.r Arg Ser 
210 

Glu Ly.: Leu Phe 
225 

Phe Lys Gly Glu 



Ala Thr Leu Ser 

260 

Trp Gin L/C Arg 



Glu Ala Tyr Lys 
2 90 

Gly Gly Pro Asp 
305 

Glu Phsr Pr.f- Asp 



Glu Glu Gin Ser 
340 

Leu Pro Ser Gly 

Gin Lye \ al Glu 
27Q 



lie Cys Leu Ser 
70 

Arg Phe Asp lie 
85 

Asp Pro Glu His 



Thr Glu Ser Gly 
120 

Met Val Ser Leu 
135 

Ser Ser Phe Lys 
150 

Glu Thr Phe Arg 
165 

Tyr Phe Asp Val 



Asp Lys Val Val 
200 

Asp Gly Asp Phe 
215 

Pro His Val Thr 
230 

Ala lie Thr Phe 
245 

His Cys lie Glu 



His Asp Arg Glu 
280 

Asn Val Met Glu 
295 

Tyr Glu Glu Gly 
310 

Ala Val Glu Ala 
325 

Gin Ser Glu Lys 



Asp Thr Phe Ser 
360 

Glu Met Val Gin 
375 



Lys Ala Val lie 
75 

Ser Val Asn Asp 
90 

Arg Gin Gin Trp 
105 

Tyr Gly Ser Glu 



Val Ser Gly Ala 
140 

Lys Gly His Ser 
155 

Asp lie Leu Cys 
170 

Cys Ala Asp Ala 
185 

Glu Asp Asp Glu 



Leu His Asn Thr 

220 

Pro Lys Gly lie 
235 

Lys Ala Thr Thr 
250 

Leu Met Val Lys 
265 

Val Glu Lys Arg 



Glu Leu Lys Lys 
300 

Pro Asn Ser Leu 
315 

Ala Leu Asp Arg 
330 

Val Arg Leu His 
345 

Ser Val Gly Thr 



Asn His Met Asn 
380 



Thr Pro His Asp 
80 

Ser Val Trp Tyr 
95 

Val Asp Ala lie 
110 

Ser Ser Leu Arg 
125 

Ser Gly Tyr Ser 



Leu Arg Glu Lys 
160 

Arg Gin Val Asp 
175 

Val Ser Lys Asp 
190 

Asp Asp Phe Pro 
205 

Asn Gly Asn Lys 

Asn Gly lie Asp 
240 

Ala Gly lie Leu 
255 

Arg Glu Glu Ser 
270 

Arg Arg Val Glu 
285 

Lys Pro Arg Phe 



lie Asn Glu Glu 
320 

Gin Asp Lys lie 
335 

Trp Pro Thr Ser 
350 

His Arg Phe Val 
365 

Tyr Ser Leu Gin 
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Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu Gly Glu 
385 390 395 400 

Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly lie Val Leu Asp 
405 410 415 

Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His Glu Val 
420 425 430 

Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu Thr Thr 
435 440 445 

lie Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala lie lie 
450 455 460 

Val Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg Asp Val 
465 470 475 480 

Leu Tyr Leu Ser Ala lie Arg Lys lie Pro Ala Leu Thr Glu Asn Asp 
485 490 495 

Pro Glu Thr Trp lie Val Cys Asn Phe Ser Val Asp His Asp Ser Ala 
500 505 510 

Pro Leu Asn Asn Arg Cys Val Arg Ala Lys lie Asn lie Ala Met lie 
515 520 525 

Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asp Gin Glu lie Ser Arg 
530 535 540 

Asp Asn lie Leu Cys Lys lie Thr Tyr Val Ala Asn Val Asn Pro Gly 
545 550 555 560 

Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg Glu Tyr 
565 570 575 

Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala 
580 585 590 

Gly Lys Pro lie Leu Phe 
595 



<210> 11 
<211> 2283 
<212> DNA 
<213> Bos taurus 

<220> 
<221> CDS 

<222> (421) - . (2214) 
<400> 11 

cggcaggaag atggcggcct agcggaggtg tgagtggacc tgggtctctg cagctgggtt 60 
ttccctcttc ccgtctttct cctcttttcc tctcccccga ggttggcatc gagggggcca 120 
aattcgggcg gcggcgccgg gcgcagcgca ggggtcacaa cgacggcgac ggctgacggt 180 
tggaagggca ggcttccttc gcccctcgac ctccttcccc ggtccgcttg gtgtcaggcg 240 



26 



DOCID <WO 0050607A2_L> 



WO 00/50607 



PCT/IB00/00324 



cggcggcggc ggcggcggcg gcgcggcggg cggactccat ccctcctccc gctccctcct 300 

gcaccggagc gggcactcct tccttcgcca tcccccgacc cttcaccccg gggactgggc 360 

gcctccaccg gcgcagctca gggagcgggg gccggtctcc tgctcggctg tcgcgcctcc 420 

atg teg gat aac cag age tgg aac teg teg ggc teg gag gag gat ccg 468 
Met Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro 
1 5 10 15 

gag acg gag tec ggg ccg ccg gtg gag cgc tgc gga gtc etc aac aag 516 
Glu Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Asn Lys 
20 25 ' 30 

tgg aca aac tat att cat ggg tgg cag gat cgc tgg gta gtt ttg aaa 564 
Trp Thr Asn Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys 
35 40 45 

aat aac act ctg agt tac tac aaa tct gaa gat gag aca gag tat ggc 612 
Asn Asn Thr Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly 
50 5 5 60 

tgc aga gga tec ate tgt ctt age aag get gtc ate acg cct cat gat 660 
Cys Arg Gly Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp 
65 70 75 80 

ttt gat gaa tgc cga ttt gat att agt gta aat gat agt gtt tgg tat 708 
Phe Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr 
85 90 95 

ctt cgt get caa gat cca gat cac aga cag cag tgg ata gat gee att 756 
Leu Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He 
100 105 110 

gaa cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt 804 
Glu Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
115 120 125 

cga cat ggc tec atg gta tea ttg gta tec gga gca agt ggc tat tct 852 
Arg His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 
13 0 13 5 140 

gca aca tec ace tec tea ttc aag aag ggc cac agt tta cgt gag aaa 900 
Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 
145 150 155 160 

ctg get gaa atg gaa acc ttt aga gat ata ctg tgt aga caa gtt gat 948 
Leu Ala Glu Met Glu Thr Phe Arg Asp He Leu Cys Arg Gin Val Asp 
165 170 ~ 175 

acc eta cag aag ttc ttt gat gee tgt get gat get gtc tec aag gat 996 
Thr Leu Gin Lys Phe Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp 
180 185 190 

gaa ttt caa agg gat aaa gtg gta gaa gat gat gaa gat gac ttt cct 1044 
Glu Phe Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro 
195 200 ~ 205 

acg aca cgt tct gat gga gac ttc ttg cat aat acc aat ggc aat aag 1092 
Thr Thr Arg Ser Asp Gly Asp Phe Leu His Asn Thr Asn Gly Asn Lys 
210 215 220 
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gaa aag gta ttt cca cat gta aca cca aaa gga act aat ggt ata gac 1140 
Glu Lys Val Phe Pro His Val Thr Pro Lys Gly He Asn Gly He Asp 
225 230 235 240 

ttt aaa ggt gag gcg ata act ttt aaa gca act act gcc gga ate ctt 1188 
Phe Lys Giy Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu 
245 250 255 

g*ct aca ctt tct cat tgt att gag ctg atg gta aaa cgt gag gac age 1236 
Ala Thr Leu Ser His Cys He Glu Leu Met Val Lys Arg Glu Asp Ser 
260 265 270 

tgg caa aag aga atg gac aag gaa act gag aag aga aga aga gtg gag 1284 
Trp Gin Lys Arg Met Asp Lys Glu Thr Glu Lys Arg Arg Arg Val Glu 
275 280 285 

gaa gca tac aaa aat gcc atg aca gaa ctt aag aaa aaa tec cac ttt 1332 
Glu Ala Tyr Lys Asn Ala Met Thr Glu Leu Lys Lys Lys Ser His Phe 
290 295 300 

gga gga cca gat tat gag gaa ggc cca aac agt ttg att aat gaa gag 1380 
Giv Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu He Asn Glu Glu 
305 310 315 320 

gag ttc ttt gat get gtt gaa get get ctt gac aga caa gat aaa ata 1428 
Glu Phe Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys He 
325 330 335 

gaa gaa rag teg cag agt gaa aag gtc agg tta cat tgg tct act tea 1476 
Giu Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Ser Thr Ser 
340 345 350 

atg cca t:: gga gat gcc ttt tct tct gtg ggg act cat aga ttt gtc 1524 
Met Pre Ser Gly Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val 
355 360 365 

caa aag get gaa gag atg gtg cag aac cac atg ace tat tea ttg cag 1572 
Gin Lys Val Glu Glu Met Val Gin Asn His Met Thr Tyr Ser Leu Gin 
370 375 380 

gat gta ggt ggg gac gcc aac tgg cag ttg gtt gta gaa gaa ggg gag 1620 
Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu Gly Glu 
385 390 395 400 

atg aag gta tat aga aga gaa gta gaa gaa aat ggg att gtt ctg gat 1668 
Met Lys Vdl Tyr Arg Arg Glu Val Glu Glu Asn Gly He Val Leu Asp 
405 410 415 

cct ttg aaa get acc cat gca gtt aaa ggc gtt aca gga cac gag gtc 1716 
Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His Glu Val 
420 425 430 

tgc aat tac ttc tgg aat gtt gat gtt cgc aat gat tgg gaa aca act 1764 
Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu Thr Thr 
435 440 445 

ata gaa aac ttt cat gtg gtg gaa aca tta get gat aat gca ate ate 1812 
lie Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala He He 
450 455 460 
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att tat caa acg cac aag aga gtg tgg cca gcc tct cag egg gat gtc 1860 
lie Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg Asp Val 
465 470 475 480 

tta tat ctg tct gcc att cga aag ata cca get ttg aat gaa aat gac 1908 
Leu Tyr Leu Ser Ala lie Arg Lys lie Pro Ala Leu Asn Glu Asn Asp 
485 490 495 

ccg gag act tgg ata gtt tgt aat ttt tct gta gat cac age agt get 1956 
Pro Glu Thr Trp lie Val Cys Asn Phe Ser Val Asp His Ser Ser Ala 
500 505 510 

cct-cta aac aat cga tgt gtc cgt gcc aaa ata aac gtt get atg att 2004 
Pro Leu Asn Asn Arg Cys Val Arg Ala Lys lie Asn Val Ala Met lie 
515 520 525 

tgt cag acc ttg gtg age ccc cca gag gga aac cag gag att age agg 2052 
Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn Gin Glu lie Ser Arg 
530 535 540 

gac aac att eta tgc aag att aca tac gtg gcc aat gta aac cct gga 2100 
Asp Asn He Leu Cys Lys He Thr Tyr Val Ala Asn Val Asn Pro Gly 
S-iS 550 555 560 

aga tgg gcc cca gcc tea gtg tta egg gca gtg gca aag cga gaa tat 2148 
Zly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg Glu Tyr 
565 570 575 

cca aag ttt eta aag cgt ttt act tct tac gta caa gaa aaa act gca 2196 
Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala 
580 585 590 

gga aaa cct att ttg ttc tagtattaac agtgactgaa gcaaggctgt 2244 
Gly Lys Pro He Leu Phe 
595 

gtgacattcc atgttggagg aaaaaaaaaa aaaaaaaaa 2283 



: ; : 5 9e 

<ZIZ> PPT 

-.213 > Bos taurus 



<A00> 12 

Met Ser Asp Asn Gin Ser Trp Asn 
1 5 

Glu Thr Glu Ser Gly Pro Pro Val 
20 

Trp Thr Asn Tyr He His Gly Trp 
35 40 

Asn Asn Thr Leu Ser Tyr Tyr Lys 
50 55 

Cys Arg Gly Ser He Cys Leu Ser 
65 ' 70 

Phe Asp Glu Cys Arg Phe Asp He 



Ser Ser Gly Ser Glu Glu Asp Pro 
10 15 

Glu Arg Cys Gly Val Leu Asn Lys 
25 * 30 

Gin Asp Arg Trp Val Val Leu Lys 
45 

Ser Glu Asp Glu Thr Glu Tyr Gly 
60 

Lys Ala Val He Thr Pro His Asp 

75 80 

Ser Val Asn Asp Ser Val Trp Tyr 
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85 



90 



95 



Leu Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp lie Asp Ala lie 
100 105 110 

Glu Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg 
115 120 125 

Arg His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser 
130 135 140 

Ala Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys 
145 150 155 160 

Leu Ala Glu Met Glu Thr Phe Arg Asp lie Leu Cys Arg Gin Val Asp 
165 170 175 

Thr Leu Gin Lys Phe Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp 
180 185 190 

Glu Phe Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro 
195 200 205 

Thr Thr Arg Ser Asp Gly Asp Phe Leu His Asn Thr Asn Gly Asn Lys 
210 215 220 

Glu Lys Val Phe Pro His Val Thr Pro Lys Gly lie Asn Gly lie Asp 
225 230 235 240 

Phe Lys Gly Glu Ala lie Thr Phe Lys Ala Thr Thr Ala Gly lie Leu 
245 250 255 

Ala Thr Leu Ser His Cys lie Glu Leu Met Val Lys Arg Glu Asp Ser 
260 265 270 

Trp Gin Lys Arg Met Asp Lys Glu Thr Glu Lys Arg Arg Arg Val Glu 
275 280 285 

Glu Ala Tyr Lys Asn Ala Met Thr Glu Leu Lys Lys Lys Ser His Phe 
290 295 300 

Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu lie Asn Glu Glu 
305 310 315 320 

Glu Phe Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys lie 
325 330 335 

Glu Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Ser Thr Ser 
340 345 350 

Met Pro Ser Gly Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val 
355 360 365 

Gin Lys Val Glu Glu Met Val Gin Asn His Met Thr Tyr Ser Leu Gin 
370 375 380 

Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu Gly Glu 
385 390 395 400 

Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly lie Val Leu Asp 



405 



410 



415 
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Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His Glu Val 
420 425 430 

Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu Thr Thr 
435 440 445 

lie Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala lie lie 
450 455 460 

lie Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg Asp Val 
465 470 475 480 

Leu Tyr Leu Ser Ala lie Arg Lys lie Pro Ala Leu Asn Glu Asn Asp 
485 490 495 

Pro Glu Thr Trp He Val Cys Asn Phe Ser Val Asp His Ser Ser Ala 
500 505 510 

Pro Leu Asn Asn Arg Cys Val Arg Ala Lys He Asn Val Ala Met He 
515 520 525 

Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn Gin Glu He Ser Arg 
530 535 540 

Asp Asn He Leu Cys Lys He Thr Tyr Val Ala Asn Val Asn Pro Gly 
545 550 555 560 

Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg Glu Tyr 
565 570 575 

Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala 
580 585 590 

Gly Lys Pro He Leu Phe 
595 



<210> 13 

<211> 78 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> (1) . . (78) 

<400> 13 

ccc tat agt cgc tct tec tec atg 
Pro Tyr Ser Arg Ser Ser Ser Met 
1 5 

tct gat gat gtt cac aga ttc age 
Ser Asp Asp Val His Arg Phe Ser 
20 



tct tec att gat eta gtc agt gec 48 
Ser Ser He Asp Leu Val Ser Ala 
10 15 

tec cag 78 
Ser Gin 
25 



<210> 14 

<211> 26 

<212> PRT 

<213> Homo sapiens 
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<400> 14 

Pro Tyr Ser Arg Ser Ser Ser Met Ser Ser He Asp Leu Val Ser Ala 
1 ^5 10 15 



Ser Asp Asp Val His Arg Phe Ser Ser Gin 
20 25 



<210> 15 
<211> 2034 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPBPR3 

<220> 

<221> CDS 

<222> (10) . . (990) 

<400> 15 

gaattcacc atg gcc cca eta gec gac tac aag gac gac gat gac aag atg 51 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met 
15 10 



teg gat aat cag age tgg aac teg teg ggc teg gag gag gat cca gag 
Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu 



15 



20 25 30 



99 



147 



acg gag tct ggg ccg cct gtg gag cgc tgc ggg gtc etc agt aag tgg 
Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp 
35 40 45 

aca aac tac att cat ggg tgg cag gat cgt tgg gta gtt ttg aaa aat 195 
Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn 
50 55 60 



aat get ctg agt tac tac aaa tct gaa gat gaa aca gag tat ggc tgc 
Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys 
65 70 75 

aga gga tec ate tgt ctt age aag get gtc ate aca cct cac gat ttt 
Arg Gly Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp Phe 
80 85 90 

gat gaa tgt cga ttt gat att agt gta aat gat agt gtt tgg tat ctt 
Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr Leu 
95 * 100 105 HO 

cgt get cag gat cca gat cat aga cag caa tgg ata gat gcc att gaa 
Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He Glu 
115 120 125 

cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt cga 
Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg 
130 135 140 

cat ggc tea atg gtg tec ctg gtg tct gga gca agt ggc tac tct gca 
His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala 
145 150 155 



243 



291 



339 



387 



435 



483 
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aca tec acc tct tea ttc aag aaa ggc cac agt tta cgt gag aag ttg 531 
Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu 
160 165 170 

get gaa atg gaa aca ttt aga gac ate tta tgt aga caa gtt gac acg 579 
Ala Glu Met Glu Thr Phe Arg Asp lie Leu Cys Arg Gin Val Asp Thr 
175 180 185 190 

eta cag aag tac ttt gat gee tgt get gat get gtc tct aag gat gaa 627 
Leu Gin Lys Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu 
195 200 205 

ctt caa agg gat aaa gtg gta gaa gat gat gaa gat gac ttt cct aca 675 
Leu Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr 
210 215 220 

acg cgt tct gat ggt gac ttc ttg cat agt acc aac ggc aat aaa gaa 723 
Thr Arg Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu 
225 230 235 

aag tta ttt cca cat gtg aca cca aaa gga att aat ggt ata gac ttt 771 
Lys Leu Phe Pro His Val Thr Pro Lys Gly lie Asn Gly lie Asp Phe 
240 245 250 

aaa ggg gaa gcg ata act ttt aaa gca act act get gga ate ctt gca 819 
Lys Gly Glu Ala lie Thr Phe Lys Ala Thr Thr Ala Gly lie Leu Ala 
255 260 265 270 

aca ctt tct cat tgt att gaa eta atg gtt aaa cgt gag gac age tgg 867 
Thr Leu Ser His Cys lie Glu Leu Met Val Lys Arg Glu Asp Ser Trp 
275 280 285 

cag aag aga ctg gat aag gaa act gag aag aaa aga aga aca gag gaa 915 
Gin Lys Arg Leu Asp Lys Glu Thr Glu Lys Lys Arg Arg Thr Glu Glu 
290 295 300 

gca tat aaa aat gca atg aca gaa cga aaa aat ccc act ttg gag gac 963 
Ala Tyr Lys Asn Ala Met Thr Glu Arg Lys Asn Pro Thr Leu Glu Asp 
305 310 315 

cag att atg aag aag gee eta aca gtc tgattaatga agaagagttc 1010 
Gin lie Met Lys Lys Ala Leu Thr Val 
320 325 

tttgatgctg ttgaagctgc tcttgacaga caagataaaa tagaagaaca gtcacagagt 1070 

gaaaaggtga gattacattg gcctacatcc ttgccctctg gagatgeett ttcttctgtg 1130 

gggacacata gatttgtcca aaagecctat agtegctett cctccatgtc ttccattgat 1190 

ctagtcagtg cctctgatga tgttcacaga ttcagctccc aggttgaaga gatggtgcag 1250 

aaccacatga cttactcatt acaggatgta ggcggagatg ccaattggca gttggttgta 1310 

gaagaaggag aaatgaaggt atacagaaga gaagtagaag aaaatgggac tgttctggat 1370 

cctttaaaag ctacccatgc agttaaaggc gtcacaggac atgaagtctg caattatttc 1430 

tggaatgttg acgttcgcaa tgactgggaa acaactatag aaaactttca tgtggtggaa 1490 
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acattagctg ataatgcaat catcatttat caaacacaca agagggtgtg gcctgcttct 155 0 
cagcgagacg tattatatct ttctgtcatt cgaaagatac cagccttgac tgaaaatgac 1610 
cctgaaactt ggatagtttg taatttttct gtggatcatg acagtgctcc tctaaacaac 1670 
cgatgtgtcc gtgccaaaat aaatgttgct atgatttgtc aaaccttggt aagcccacca 1730 
gagggaaacc aggaaattag cagggacaac attctatgca agattacata tgtagctaat 1790 
gtgaaccctg gaggatgggc accagcctca gtgttaaggg cagtggcaaa gcgagagtat 1850 
cctaaatttc taaaacgttt tacttcttac gtccaagaaa aaactgcagg aaagcctatt 1910 
ttgttctagt attaacaggt actagaagat atgttttatc tttttttaac tttatttgac 1970 
taatatgact gtcaatacta aaatttagtt gttgaaagta tttactatgt tttttccgga 2030 
attc 2034 



<210> 16 
<211> 327 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPBPR3 
<400> 16 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met Ser Asp 
15 10 15 

Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu Thr Glu 
20 25 30 

Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp Thr Asn 
35 40 45 

Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn Asn Ala 
50 55 60 

Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys Arg Gly 
65 * 70 75 80 

Ser lie Cys Leu Ser Lys Ala Val lie Thr Pro His Asp Phe Asp Glu 
85 90 95 

Cys Arg Phe Asp lie Ser Val Asn Asp Ser Val Trp Tyr Leu Arg Ala 
100 105 110 

Gin Asp Pro Asp His Arg Gin Gin Trp lie Asp Ala lie Glu Gin His 
115 120 125 

Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg His Gly 
130 135 140 

Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala Thr Ser 
145 150 155 160 

Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu 



34 



;DOCID <WO 005060 7A2 J _> 



WO 00/50607 



PCT/IB00/00324 



Met Glu Thr Phe 
180 

Lys Tyr Phe Asp 
195 

Arg Asp Lys Val 
210 

Ser Asp Gly Asp 
225 

Phe Pro His Val 



Glu Ala He Thr 
260 

Ser His Cys He 
275 

Arg Leu Asp Lys 
290 

Lys Asn Ala Met 
305 

Met Lys Lys Ala 



165 

Arg Asp He Leu 



Ala Cys Ala Asp 
200 

Val Glu Asp Asp 
215 

Phe Leu His Ser 
230 

Thr Pro Lys Gly 
245 

Phe Lys Ala Thr 



Glu Leu Met Val 
280 

Glu Thr Glu Lys 
295 

Thr Glu Arg Lys 
310 

Leu Thr Val 

325 



170 

Cys Arg Gin Val 
185 

Ala Val Ser Lys 



Glu Asp Asp Phe 
220 

Thr Asn Gly Asn 
235 

He Asn Gly He 
250 

Thr Ala Gly He 
265 

Lys Arg Glu Asp 



Lys Arg Arg Thr 
300 

Asn Pro Thr Leu 
315 



175 

Asp Thr Leu Gin 
190 

Asp Glu Leu Gin 
205 

Pro Thr Thr Arg 



Lys Glu Lys Leu 
240 

Asp Phe Lys Gly 
255 

Leu Ala Thr Leu 
270 

Ser Trp Gin Lys 
285 

Glu Glu Ala Tyr 



Glu Asp Gin He 
320 



<210> 17 
<211> 1978 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FLAG-GPBPDNLS 

<220> 

<221> CDS 

<222> (10) . . (1860) 

<400> 17 

gaattcacc atg gcc cca eta gec gac tac aag gac gac gat gac aag atg 51 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met 
1 5 10 

teg gat aat cag age tgg aac teg teg ggc teg gag gag gat cca gag 99 
Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu 
15 20 25 " 30 

acg gag tct ggg ccg cct gtg gag cgc tgc ggg gtc etc agt aag tgg 147 
Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp 
35 40 45 

aca aac tac att cat ggg tgg cag gat cgt tgg gta gtt ttg aaa aat 195 
Thr Asn Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn 
50 55 60 
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aat get ctg agt tac tac aaa tct gaa gat gaa aca gag tat ggc tgc 243 
Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys 
65 70 75 

a 9 a gga ccc atc t 9 t ctt a 9 c aa 9 9 ct 9 tc atc aca cct cac 9 at tct 291 
Arg Gly Ser lie Cys Leu Ser Lys Ala Val lie Thr Pro His Asp Phe 
80 85 90 

gat gaa tgt cga ttt gat att agt gta aat gat agt gtt tgg tar ctt 339 
Asp Glu Cys Arg Phe Asp lie Ser Val Asn Asp Ser Val Trp Tyr Leu 
95 100 105 110 

cgt get cag gat cca gat cat aga cag caa tgg ata gat gec att gaa 3 87 
Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp lie Asp Ala He Glu 
115 120 125 

cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt cga 
Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg 
130 135 140 

cat ggc tea atg gtg tec ctg gtg tct gga gca agt ggc tac tct gca 
His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala 
145 150 155 

aca tec acc tct tea ttc aag aaa ggc cac agt tta cgt gag aag ttg 531 
Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu 
160 165 170 

get gaa atg gaa aca ttt aga gac atc tta tgt aga caa gtt gac acg 579 
Ala Glu Met Glu Thr Phe Arg Asp He Leu Cys Arg Gin Val Asp Thr 
175 180 185 190 

eta cag aag tac ttt gat gee tgt get gat get gtc tct aag gat gaa 627 
Leu Gin Lvs Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu 
195 200 205 



ctt caa agg gat aaa gtg gta gaa gat gat gaa gat gac ttt cct aca 
Leu Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr 
210 215 220 



435 



483 



aag tta ttt cca cat gtg aca cca aaa gga att aat ggt ata gac ttt 
Lys Leu Phe Pro His Val Thr Pro Lys Gly He Asn Gly He Asp Phe 
240 245 250 

aaa ggg gaa gcg ata act ttt aaa gca act act: get gga atc ctt gca 
Lys Gly Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu Ala 
255 * 260 265 270 

aca ctt tct cat tgt att gaa eta atg gtt aaa cgt gag gac age tgg 
Thr Leu Ser His Cys He Glu Leu Met Val Lys Arg Glu Asp Ser Trp 
275 280 285 

cag aag aga ctg gat aag gaa act gag cac ttt gga gga cca gat tat 
Gin Lys Arg Leu Asp Lys Glu Thr Glu His Phe Gly Gly Pro Asp Tyr 
290 295 300 



675 



acg cgt tct gat ggt gac ttc ttg cat agt acc aac ggc aat aaa gaa 723 
Thr Arg Ser Asp Gly Asp Phe Leu His . Ser Thr Asn Gly Asn Lys Glu 
225 230 235 



771 



819 



867 



915 
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gaa gaa ggc cct aac agt ctg att aat gaa gaa gag ttc ttt gat get 963 
Glu Glu Gly Pro Asn Ser Leu lie Asn Glu Glu Glu Phe Phe Asp Ala 
305 310 315 

get gaa get get ctt gac aga caa gat aaa ata gaa gaa cag tea cag 1011 
Val Glu Ala Ala Leu Asp Arg Gin Asp Lys He Glu Glu Gin Ser Gin 
220 325 330 

age gaa aag gtg aga tta cat tgg cct aca tec ttg ccc tct gga gat 1059 
Ser Glu Lys Val Arg Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp 
335 340 345 350 

gec ttt tct tct gtg ggg aca cat aga ttt gtc caa aag ccc tat agt 1107 
Ala Phe Ser Ser Val Gly Thr His Arg Phe Val Gin Lys Pro Tyr Ser 
355 360 365 

cgc tct tec tec atg tct tec att gat eta gtc agt gec tct gat gat 1155 
Arg Ser Ser Ser Met Ser Ser He Asp Leu Val Ser Ala Ser Asp Asp 
370 375 380 

gtt cac aga ttc age tec cag gtt gaa gag atg gtg cag aac cac atg 1203 
Val His Arg Phe Ser Ser Gin Val Glu Glu Met Val Gin Asn His Met 
385 390 395 

act tar tea tta cag gat gta ggc gcja gat gee aat tgg cag ttg gtt 1251 
Thr Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val 
400 405 410 

gta gaa gaa gga gaa atg aag gta tac aga aga gaa gta gaa gaa aat 1299 
Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn 
415 420 425 430 

ggg att gtt ctg gat cct tta aaa get acc cat gca gtt aaa ggc gtc 1347 
Gly He Val Leu Asp Pro Leu Lys Ala Thr His Ala Val Lys Gly Val 
435 440 445 

aca gga cat gaa gtc tgc aat tat ttc tgg aat gtt gac gtt cgc aat 1395 
Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn 
450 455 460 

gac tgg gaa aca act ata gaa aac ttt cat gtg gtg gaa aca tta get 1443 
Asp Trp Glu Thr Thr He Glu Asn Phe His Val Val Glu Thr Leu Ala 
465 470 475 

gat aat gca ate ate att tat caa aca cac aag agg gtg tgg cct get 1491 
Asp Asn Ala He He He Tyr Gin Thr His Lys Arg Val Trp Pro Ala 
48C 485 490 

tct caa rga gac gta tta tat ctt tct gtc att cga aag ata cca gee 1539 
Ser Glr. Ara Asp Val Leu Tyr Leu Ser Val He Arg Lys He Pro Ala 
495 500 505 510 

ttg act gaa aat gac cct gaa act tgg ata gtt tgt aat ttt tct gtg 1587 
Leu Thr Glu Asn Asp Pro Glu Thr Trp He Val Cys Asn Phe Ser Val 
515 520 525 

gat cat qac agt get cct eta aac aac cga tgt gtc cgt gec aaa ata 1635 
Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys Val Arg Ala Lys He 
530 535 540 

aat gtt art atg att tgt caa acc ttg gta age cca cca gag gga aac 1683 
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Asn Val Ala Met lie Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn 
545 550 555 

cag gaa att age agg gac aac att eta tgc aag att aca tat gta get 1731 
Gin Glu lie Ser Arg Asp Asn lie Leu Cys Lys lie Thr Tyr Val Ala 
560 565 570 

aat gtg aac cct gga gga tgg gca cca gec tea gtg tta agg gca gtg 1779 
Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val 
575 580 585 590 

gca aag cga gag tat cct aaa ttt eta aaa cgt ttt act tct tac gtc 1827 
Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val 
595 600 605 

caa gaa aaa act gca gga aag cct att ttg ttc tagtattaac aggtactaga 1880 
Gin Glu Lys Thr Ala Gly Lys Pro lie Leu Phe 
610 615 

agatatgttt tatctttttt taactttatt tgactaatat gactgtcaat actaaaattt 1940 

agttgttgaa agtatttact atgttttttc eggaatte 1978 



<210> 18 
<211> 617 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FLAG -GPBPDNLS 
<400> 18 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met Ser Asp 
1 5 10 15 

Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu Thr Glu 
20 25 30 

Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp Thr Asn 
35 40 45 

Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn Asn Ala 
50 55 60 

Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys Arg Gly 
65 70 75 80 

Ser lie Cys Leu Ser Lys Ala Val lie Thr Pro His Asp Phe Asp Glu 
85 90 95 

Cys Arg Phe Asp lie Ser Val Asn Asp Ser Val Trp Tyr Leu Arg Ala 
100 105 110 

Gin Asp Pro Asp His Arg Gin Gin Trp lie Asp Ala lie Glu Gin His 
115 120 125 

Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg His Gly 
130 135 140 

Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala Thr Ser 
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145 



150 



155 



160 



Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu 
165 170 175 

Met Glu Thr Phe Arg Asp lie Leu Cys Arg Gin Val Asp Thr Leu Gin 
180 185 190 

Lys Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin 
195 200 205 

Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg 
210 215 220 

Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu 
225 230 235 * 240 

Phe Pro His Val Thr Pro Lys Gly lie Asn Gly lie Asp Phe Lys Gly 
245 250 " 255 

Glu Ala lie Thr Phe Lys Ala Thr Thr Ala Gly lie Leu Ala Thr Leu 
260 265 270 

Ser His Cys lie Glu Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys 
275 280 " 285 

Arg Leu Asp Lys Glu Thr Glu His Phe Gly Gly Pro Asp Tyr Glu Glu 
290 295 300 

Gly Pro Asn Ser Leu lie Asn Glu Glu Glu Phe Phe Asp Ala Val Glu 
305 310 315 320 

Ala Ala Leu Asp Arg Gin Asp Lys lie Glu Glu Gin Ser Gin Ser Glu 
325 330 335 

Lys Val Arg Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Ala Phe 
340 345 350 

Ser Ser Val Gly Thr His Arg Phe Val Gin Lys Pro Tyr Ser Arg Ser 
355 360 365 

Ser Ser Met Ser Ser lie Asp Leu Val Ser Ala Ser Asp Asp Val His 
370 375 380 

Arg Phe Ser Ser Gin Val Glu Glu Met Val Gin Asn His Met Thr Tyr 
385 390 395 400 

Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu 
405 410 415 

Glu Gly Glu Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly lie 
420 425 430 

Val Leu Asp Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly 
435 440 445 

His Glu Val Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp 
450 455 460 

Glu Thr Thr He Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn 
465 470 475 480 
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Ala He He He 



Arg Asp Val Leu 
500 

Glu Asn Asp Pro 
515 

Asp Ser Ala Pro 
530 

Ala Met He Cys 
545 

He Ser Arg Asp 



Asn Pro Gly Gly 
580 

Arg Glu Tyr Pro 
595 



Tyr Gin Thr His 
485 

Tyr Leu Ser Val 



Glu Thr Trp He 
52 0 

Leu Asn Asn Arg 
535 

Gin Thr Leu Val 
550 

Asn He Leu Cys 
565 

Trp Ala Pro Ala 



Lys Phe Leu Lys 
600 



Lys Arg Val Trp 
490 

He Arg Lys lie 
505 

Val Cys Asn Phe 



Cys Val Arg Ala 
540 

Ser Pro Pro Glu 
555 

Lys He Thr Tyr 
570 

Ser Val Leu Arg 
585 

Arg Phe Thr Ser 



Pro Ala Ser Gin 
495 

Pro Ala Leu Thr 
510 

Ser Val Asp His 
525 

Lys He Asn Val 



Gly Asn Gin Glu 
560 

Val Ala Asn Val 
575 

Ala Val Ala Lys 
590 

Tyr Val Gin Glu 
605 



Lvs Thr Ala Gly Lys Pro He Leu Phe 
610 615 



< 2 1 0 > 19 
<21!> 19^5 

<2i2> d:;a 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: FLAG-GPBPDSXY 



<220 > 

<222 > CO . . \ 1857) 



<400> 19 

gaattcacc atg gcc cca eta gec gac tac aag gac gac gat gac aag atg 51 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met 
15 10 



teg gat aat cag age tgg aac teg teg ggc teg gag gag gat cca gag 
Ser Asp Asr. Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu 
15 20 25 30 



aat get eta agt tac tac aaa tct gaa gat gaa aca gag tat ggc tgc 
Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys 

65 70 75 



99 



acg gaq tct ggg ccg cct gtg gag cgc tgc ggg gtc etc agt aag tgg 147 
Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp 
35 40 45 

aca aac tac att cat ggg tgg cag gat cgt tgg gta gtt ttg aaa aat 195 
Tnr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn 
50 55 60 



243 
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aga gga tec ate tgt ctt age aag get gtc ate aca cct cac gat ttt 291 
Arg Gly Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp Phe 
80 85 90 

gat gaa tgt cga ttt gat att agt gta aat gat agt gtt tgg tat ctt 339 
Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr Leu 
95 100 105 110 

cgt get cag gat cca gat cat aga cag caa tgg ata gat gee att gaa 3 87 
Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He Glu 
115 120 125 

cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt cga 435 
Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg 
130 135 140 

cat ggc aaa ggc cac agt tta cgt gag aag ttg get gaa atg gaa aca 483 
His Gly Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr 
145 150 155 

ttt aga gac ate tta tgt aga caa gtt gac acg eta cag aag tac ttt 531 
Phe Arg Asp He Leu Cys Arg Gin Val Asp Thr Leu Gin Lys Tyr Phe 
160 165 170 

gat gec tgt get gat get gtc tct aag gat gaa ctt caa agg gat aaa 579 
Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys 
ITS 180 185 190 

gtg gta gaa gat gat gaa gat gac ttt cct aca acg cgt tct gat ggt 627 
Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly 
195 200 205 

gac ttc ttg cat agt ace aac ggc aat aaa gaa aag tta ttt cca cat 675 
Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu Phe Pro His 
210 215 220 

gza aca cca aaa gga att aat ggt ata gac ttt aaa ggg gaa gcg ata 723 
Va! Thr Pro Lys Gly He Asn Gly He Asp Phe Lys Gly Glu Ala He 
225 230 235 

act ttt aaa gca act act get gga ate ctt gca aca ctt tct cat tgt 771 
Thr Phe Lys Ala Thr Thr Ala Gly He Leu Ala Thr Leu Ser His Cys 
240 245 250 

att gaa eta atg gtt aaa cgt gag gac age tgg cag aag aga ctg gat 819 
lie Glu Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys Arg Leu Asp 
255 260 265 270 

aag gaa act gag aag aaa aga aga aca gag gaa gca tat aaa aat gca 867 
Lys Glu Thr Glu Lys Lys Arg Arg Thr Glu Glu Ala Tyr Lys Asn Ala 
275 280 285 

atg aca gaa ctt aag aaa aaa tec cac ttt gga gga cca gat tat gaa 915 
Met Thr Glu Leu Lys Lys Lys Ser His Phe Gly Gly Pro Asp Tyr Glu 
290 295 300 

gaa ggc cct aac agt ctg att aat gaa gaa gag ttc ttt gat get gtt 963 
Glu Gly Pro Asn Ser Leu He Asn Glu Glu Glu Phe Phe Asp Ala Val 
305 310 315 
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cac aga ttc age tec cag gtt gaa gag atg gtg cag aac cac atg act 
His Arg Phe Ser Ser Gin Val Glu Glu Met Val Gin Asn His Met Thr 
385 390 395 



1059 



gaa get get ctt gac aga caa gat aaa ata gaa gaa cag tea cag agt 1011 
Glu Ala Ala Leu Asp Arg Gin Asp Lys He Glu Glu Gin Ser Gin Ser 
320 325 330 

gaa aag gtg aga tta cat tgg cct aca tec ttg ccc tct gga gat gee 
Glu Lys Val Arg Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Ala 
335 340 345 350 

ttt tct tct gtg ggg aca cat aga ttt gtc caa aag ccc tat agt cgc 1107 
Phe Ser Ser Val Gly Thr His Arg Phe Val Gin Lys Pro Tyr Ser Arg 
355 360 365 

tct tec tec atg tct tec att gat eta gtc agt gee tct gat gat gtt 1155 
Ser Ser Ser Met Ser Ser He Asp Leu Val Ser Ala Ser Asp Asp Val 
370 375 380 



1203 



cac tea tta cag gat gta ggc gga gat gec aat tgg cag ttg gtt gta 1251 
Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val 
400 405 410 

gaa gaa gga gaa atg aag gta tac aga aga gaa gta gaa gaa aat ggg 12 99 
Glu Glu Gly Glu Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly 
415 1 420 425 430 

att gtt ctg gat cct tta aaa get acc cat gca gtt aaa ggc gtc aca 
He Val Leu Asp Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr 
435 440 445 

gga cat gaa gtc tgc aat tat ttc tgg aat gtt gac gtt cgc aat gac 
Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp 
450 455 460 

tgg gaa aca act ata gaa aac ttt cac gtg gtg gaa aca tta get gat 
Trp Glu Thr Thr He Glu Asn Phe His Val Val Glu Thr Leu Ala Asp 
465 470 475 

aat gca ate ate att tat caa aca cac aag agg gtg tgg cct get tct 
Asn Ala He He He Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser 
480 485 490 

cag cga gac gta tta tat ctt tct gtc att cga aag ata cca gee ttg 
Gin Arg Asp Val Leu Tyr Leu Ser Val He Arg Lys He Pro Ala Leu 
495 500 505 510 

act gaa aat gac cct gaa act tgg ata gtt tgt aat ttt tct gtg gat 1587 
Thr Glu Asn Asp Pro Glu Thr Trp He Val Cys Asn Phe Ser Val Asp 
515 * 520 525 

cat gac agt get cct eta aac aac cga tgt gtc cgt gee aaa ata aat 1635 
His Asp Ser Ala Pro Leu Asn Asn Arg Cys Val Arg Ala Lys He Asn 
530 535 540 



1347 



1395 



1443 



1491 



1539 



gtt get atg att tgt caa acc ttg gta age cca cca gag gga aac cag 
Val Ala Met He Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn Gin 
545 550 555 

gaa att age agg gac aac att eta tgc aag att aca tat gta get aat 



1683 



1731 
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Glu He Ser Arg Asp Asn He Leu Cys Lys He Thr Tyr Val Ala Asn 
560 565 570 

gtg aac cct gga gga tgg gca cca gcc tea gtg eta agg gca gtg gca 1779 
Val Asn Pro Gly Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala 
575 580 585 ~ 590 

aag cga gag tat cct aaa ttt eta aaa cgt ttt act tct tac gtc caa 1827 
Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin 
595 600 605 

gaa aaa act gca gga aag cct att ttg ttc tagtattaac aggtactaga 1877 
Glu Lys Thr Ala Gly Lys Pro He Leu Phe 
610 615 

agatatgttt tatctttttt taactttatt tgactaatat gactgtcaat actaaaattt 1937 
agttgttgaa agtatttact atgttttttc eggaatte 1975 



<210> 20 
<211> 616 
<212> PRT 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: FLAG-GPBPDSXY 



<400> 20 

Met Ala Pro Leu Ala Asp Tyr Lys 
1 5 

Asn Gin Ser Trp Asn Ser Ser Gly 
20 

Ser Gly Pro Pro Val Glu Arg Cys 
35 40 

Tyr He His Gly Trp Gin Asp Arg 
50 55 



Asp Asp Asp Asp Lys Met Ser Asp 
10 15 

Ser Glu Glu Asp Pro Glu Thr Glu 
25 30 

Gly Val Leu Ser Lys Trp Thr Asn 
45 

Trp Val Val Leu Lys Asn Asn Ala 
60 



Leu Ser Tyr Tyr 
65 

Ser He Cys Leu 



Cys Arg Phe Asp 
100 

Gin Asp Pro Asp 
115 

Lys Thr Glu Ser 
130 

Lys Gly His Ser 
145 

Asp He Leu Cys 



Lys Ser Glu Asp 
70 

Ser Lys Ala Val 
85 

He Ser Val Asn 



His Arg Gin Gin 
120 

Gly Tyr Gly Ser 
135 

Leu Arg Glu Lys 
150 

Arg Gin Val Asp 
165 



Glu Thr Glu Tyr 
75 

He Thr Pro His 
90 

Asp Ser Val Trp 
105 

Trp He Asp Ala 



Glu Ser Ser Leu 
140 

Leu Ala Glu Met 
155 

Thr Leu Gin Lys 
170 



Gly Cys Arg Gly 
80 

Asp Phe Asp Glu 
95 

Tyr Leu Arg Ala 
110 

He Glu Gin His 
125 

Arg Arg His Gly 



Glu Thr Phe Arg 
160 

Tyr Phe Asp Ala 
175 
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Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys Val Val 
180 185 190 

Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly Asp Phe 
195 200 205 

Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu Phe Pro His Val Thr 
210 215 220 

Pro Lys Gly He Asn Gly He Asp Phe Lys Gly Glu Ala He Thr Phe 
225 230 235 240 

Lys Ala Thr Thr Ala Gly He Leu Ala Thr Leu Ser His Cys He Glu 
245 250 255 

Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys Arg Leu Asp Lys Glu 
260 265 270 

Thr Glu Lys Lys Arg Arg Thr Glu Glu Ala Tyr Lys Asn Ala Met Thr 
275 280 285 

Glu Leu Lys Lvs Lys Ser His Phe Gly Gly Pro Asp Tyr Glu Glu Gly 
290 295 300 

P^o Asn Ser Leu He Asn Glu Glu Glu Phe Phe Asp Ala Val Glu Ala 
305 310 315 320 

A. a Leu Asp Arg Gin Asp Lys lie Glu Glu Gin Ser Gin Ser Glu Lys 
325 330 335 

Val Arg Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Ala Phe Ser 
340 345 350 

3er Val Gly Thr His Arg Phe Val Gin Lys Pro Tyr Ser Arg Ser Ser 
355 360 365 

3er Met Ser Ser He Asp Leu Val Ser Ala Ser Asp Asp Val His Arg 

375 380 

Ph<- :Jer Ser Gin Val Glu Glu Met Val Gin Asn His Met Thr Tyr Ser 
355 " 390 395 400 

Leu G.r. Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu 
405 410 415 

Giv Glu Met Lvs Val Tyr Arg Arg Glu Val Glu Glu Asn Gly He Val 
420 425 430 

Leu Asd Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His 
435 440 445 

Glu Val Cvs Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu 
450 * 455 460 

^hr Thr He Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala 
465 ~ 470 475 480 

He He He Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg 
485 490 495 
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Asp Val Leu Tyr Leu Ser Val lie Arg Lys lie Pro Ala Leu Thr Glu 
500 505 510 

Asn Asp Pro Glu Thr Trp lie Val Cys Asn Phe Ser Val Asp His Asp 
515 520 525 

Ser Ala Pro Leu Asn Asn Arg Cys Val Arg Ala Lys lie Asn Val Ala 
530 535 540 

Met lie Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn Gin Glu lie 
545 550 555 560 

Ser Arg Asp Asn lie Leu Cys Lys lie Thr Tyr Val Ala Asn Val Asn 
565 570 575 

Pro Gly Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg 
580 585 590 

Glu Tyr Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys 
595 600 605 

Thr Ala Gly Lys Pro lie Leu Phe 
610 615 



<210> 21 
<211> 1915 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
FLAG-GPBPDSXY/NLS 

<220> 

<221> CDS 

<222> (10) . . (1797) 

<400> 21 

gaattcacc atg gcc cca eta gec gac tac aag gac gac gat gac aag atg 51 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met 
1 5 10 

teg gat aat cag age tgg aac teg teg ggc teg gag gag gat cca gag 99 
Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu 
15 20 25 30 

acg gag tct ggg ccg cct gtg gag cgc tgc ggg gtc etc agt aag tgg 147 
Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp 
35 40 45 

aca aac tac att cat ggg tgg cag gat cgt tgg gta gtt ttg aaa aat 195 
Thr Asn Tyr lie His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn 
50 55 60 

aat get ctg agt tac tac aaa tct gaa gat gaa aca gag tat ggc tgc 243 
Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys 
65 70 75 

aga gga tec ate tgt ctt age aag get gtc ate aca cct cac gat ttt 291 
Arg Gly Ser lie Cys Leu Ser Lys Ala Val lie Thr Pro His Asp Phe 
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80 85 90 

gat gaa tgt cga ttt gat att agt gta aat gat agt gtt tgg tat ctt 

Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr Leu 

95 100 105 110 

cgt get cag gat cca gat cat aga cag caa tgg ata gat gec att gaa 

Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He Glu 

115 12 0 12 5 



gtg gta gaa gat gat gaa gat gac ttt cct aca acg cgt tct gat ggt 
Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly 
195 200 205 



339 



387 



cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt cga 435 
Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg 
130 135 140 

cat ggc aaa ggc cac agt tta cgt gag aag ttg get gaa atg gaa aca 4 83 
His Gly Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr 
145 150 155 

ttt aga gac ate tta tgt aga caa gtt gac acg eta cag aag tac ttt 531 
Phe Arg Asp He Leu Cys Arg Gin Val Asp Thr Leu Gin Lys Tyr Phe 
160 " 165 170 

gat gec tgt get gat get gtc tct aag gat gaa ctt caa agg gat aaa 579 
Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys 
175 180 185 190 



gtg aca cca aaa gga att aat ggt ata gac ttt aaa ggg gaa gcg ata 
Val Thr Pro Lys Gly He Asn Gly lie Asp Phe Lys Gly Glu Ala He 
225 230 235 

act ttt aaa gca act act get gga ate ctt gca aca ctt tct cat tgt 
Thr Phe Lys Ala Thr Thr Ala Gly He Leu Ala Thr Leu Ser His Cys 
240 245 250 

att gaa eta atg gtt aaa cgt gag gac age tgg cag aag aga ctg gat 
lie Glu Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys Arg Leu Asp 
255 260 265 270 

aag gaa act gag cac ttt gga gga cca gat tat gaa gaa ggc cct aac 
Lys Glu Thr Glu His Phe Gly Gly Pro Asp Tyr Glu Glu Gly Pro Asn 
275 280 285 



627 



gac ttc ttg cat agt ace aac ggc aat aaa gaa aag tta ttt cca cat 675 
Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu Phe Pro His 
210 215 220 



723 



771 



819 



867 



agt ctg att aat gaa gaa gag ttc ttt gat get gtt gaa get get ctt 915 
Ser Leu He Asn Glu Glu Glu Phe Phe Asp Ala Val Glu Ala Ala Leu 
290 295 300 

gac aga caa gat aaa ata gaa gaa cag tea cag agt gaa aag gtg aga 963 
Asp Arg Gin Asp Lys He Glu Glu Gin Ser Gin Ser Glu Lys Val Arg 
305 310 315 

tta cat tgg cct aca tec ttg ccc tct gga gat gee ttt tct tct gtg 1011 
Leu His Trp Pro Thr Ser Leu Pro Ser Gly Asp Ala Phe Ser Ser Val 
320 325 330 
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ggg aca cat aga ttt gtc caa aag ccc tat agt cgc tct tec tec atg 1059 
Gly Thr His Arg Phe Val Gin Lys Pro Tyr Ser Arg Ser Ser Ser Met 
335 340 345 350 

tct tec att gat eta gtc agt gee tct gat gat gtt cac aga ttc age 1107 
Ser Ser lie Asp Leu Val Ser Ala Ser Asp Asp Val His Arg Phe Ser 
355 360 ~ 365 

tec cag gtt gaa gag atg gtg cag aac cac atg act tac tea tta cag 1155 
Ser Gin Val Glu Glu Met Val Gin Asn His Met Thr Tyr Ser Leu Gin 
370 375 380 

gat gta ggc gga gat gec aat tgg cag ttg gtt gta gaa gaa gga gaa 1203 
Asp Val Gly Gly Asp Ala Asn Trp Gin Leu Val Val Glu Glu Gly Glu 
385 390 395 

atg aag gta tac aga aga gaa gta gaa gaa aat ggg att gtt ctg gat 1251 
Met Lys Val Tyr Arg Arg Glu Val Glu Glu Asn Gly lie Val Leu Asp 
4 00 4 05 410 

cct tta aaa get acc cat gca gtt aaa ggc gtc aca gga cat gaa gtc 12 99 
Pro Leu Lys Ala Thr His Ala Val Lys Gly Val Thr Gly His Glu Val 
415 420 425 430 

tgc aat tat ttc tgg aat gtt gac gtt cgc aat gac tgg gaa aca act 1347 
Cys Asn Tyr Phe Trp Asn Val Asp Val Arg Asn Asp Trp Glu Thr Thr 
435 440 445 

ata gaa aac ttt cat gtg gtg gaa aca tta get gat aat gca ate ate 1395 
lie Glu Asn Phe His Val Val Glu Thr Leu Ala Asp Asn Ala lie lie 
450 455 460 

att tat caa aca cac aag agg gtg tgg cct get tct cag cga gac gta 1443 
lie Tyr Gin Thr His Lys Arg Val Trp Pro Ala Ser Gin Arg Asp Val 
465 470 475 

tta tat ctt tct gtc att cga aag ata cca gee ttg act gaa aat gac 1491 
Leu Tyr Leu Ser Val He Arg Lys He Pro Ala Leu Thr Glu Asn Asp 
480 485 490 

cct gaa act tgg ata gtt tgt aat ttt tct gtg gat cat gac agt get 1539 
Pro Glu Thr Trp He Val Cys Asn Phe Ser Val Asp His Asp Ser Ala 
495 500 505 510 

cct eta aac aac cga tgt gtc cgt gee aaa ata aat gtt get atg att 1587 
Pro Leu Asn Asn Arg Cys Val Arg Ala Lys He Asn Val Ala Met He 
515 520 525 

tgt caa acc ttg gta age cca cca gag gga aac cag gaa att age agg 1635 
Cys Gin Thr Leu Val Ser Pro Pro Glu Gly Asn Gin Glu He Ser Arg 
530 535 540 

gac aac att eta tgc aag att aca tat gta get aat gtg aac cct gga 1683 
Asp Asn He Leu Cys Lys lie Thr Tyr Val Ala Asn Val Asn Pro Gly 
545 550 555 

99^ tgg gca cca gec tea gtg tta agg gca gtg gca aag cga gag tat 1731 
Gly Trp Ala Pro Ala Ser Val Leu Arg Ala Val Ala Lys Arg Glu Tyr 
560 565 570 
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cct aaa ttt eta aaa cgt ttt act tct tac gtc caa gaa aaa act gca 1779 
Pro Lys Phe Leu Lys Arg Phe Thr Ser Tyr Val Gin Glu Lys Thr Ala 
575 580 585 590 

gga aag cct att ttg ttc tagtattaac aggtactaga agatatgttt 1827 
Gly Lys Pro He Leu Phe 
5 95 

tatctttttt taactttatt tgactaatat gactgtcaat actaaaattt agttgttgaa 1887 
agtatttact atgttttttc eggaatte 1915 



<210> 22 
<211> 596 
<212> ?RT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
FLAG-GPBPDSXY/NLS 

<400> 22 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met Ser Asp 
15 10 15 

Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu Thr Glu 
20 25 30 

Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp Thr Asn 
3 5 40 4 5 

Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn Asn Ala 
50 55 60 

Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys Arg Gly 
65 70 75 80 

Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp Phe Asp Glu 
85 90 95 

Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr Leu Arg Ala 
100 105 110 

Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He Glu Gin His 
115 120 125 

Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg His Gly 
130 135 140 

Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu Met Glu Thr Phe Arg 
145 150 155 160 

Asp He Leu Cys Arg Gin Val Asp Thr Leu Gin Lys Tyr Phe Asp Ala 
165 170 175 

Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin Arg Asp Lys Val Val 
180 185 190 

Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg Ser Asp Gly Asp Phe 
195 200 205 
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Leu His Ser Thr 
210 

Pro Lys Giy lie 



Lye Aia Thr Thr 



Leu Met Val Lys 
260 

Thr Glu His Phe 
275 

He Asn Glu Glu 
290 

Gin Asp Lys He 
305 

Trp Pro Thr Ser 



His Arg Phe Val 
340 

He Asp Leu Val 

:ss 

Val Glu Ciu Met 
37C 

Gly Gly Asp Ala 
385 

Val Tyr Arg Arg 



Lys Aia Tr.r His 
420 

Tyr Phe Trp Asn 
435 

Asn Phe His Val 
45C 

Gin Thr H l c Lys 
465 

Leu Ser Val He 



Thr Trp He Val 
500 

Asn Asr. Arg Cys 
SIS 



Asn Gly Asn Lys 
215 

Asn Gly He Asp 
230 

Ala Gly He Leu 
245 

Arg Glu Asp Ser 



Gly Gly Pro Asp 
280 

Glu Phe Phe Asp 
2 95 

Glu Glu Gin Ser 
310 

Leu Pro Ser Gly 
325 

Gin Lys Pro Tyr 



Ser Ala Ser Asp 
360 

Val Gin Asn His 
375 

Asn Trp Gin Leu 
390 

Glu Val Glu Glu 
405 

Ala Val Lys Gly 



Val Asp Val Arg 
440 

Val Glu Thr Leu 
455 

Arg Val Trp Pro 

470 

Arg Lys He Pro 
485 

Cys Asn Phe Ser 



Val Arg Ala Lys 
520 



Glu Lys Leu Phe 
220 

Phe Lys Gly Glu 
235 

Ala Thr Leu Ser 
250 

Trp Gin Lys Arg 
265 

Tyr Glu Glu Gly 



Ala Val Glu Ala 
300 

Gin Ser Glu Lys 
315 

Asp Ala Phe Ser 
330 

Ser Arg Ser Ser 
345 

Asp Val His Arg 



Met Thr Tyr Ser 
380 

Val Val Glu Glu 
395 

Asn Gly He Val 
410 

Val Thr Gly His 
425 

Asn Asp Trp Glu 



Ala Asp Asn Ala 
460 

Ala Ser Gin Arg 
475 

Ala Leu Thr Glu 
490 

Val Asp His Asp 
505 

He Asn Val Ala 



Pro His Val Thr 



Ala He Thr Phe 
240 

His Cys He Glu 
255 

Leu Asp Lys Glu 
270 

Pro Asn Ser Leu 
285 

Ala Leu Asp Arg 



Val Arg Leu His 
320 

Ser Val Gly Thr 
335 

Ser Met Ser Ser 
350 

Phe Ser Ser Gin 
365 

Leu Gin Asp Val 



Gly Glu Met Lys 
400 

Leu Asp Pro Leu 
415 

Glu Val Cys Asn 
430 

Thr Thr He Glu 
445 

He lie He Tyr 



Asp Val Leu Tyr 
480 

Asn Asp Pro Glu 
495 

Ser Ala Pro Leu 
510 

Met He Cys Gin 
525 
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Thr Leu Val Ser 
530 

He Leu Cys Lys 
545 

Ala Pro Ala Ser 



Phe Leu Lys Arg 
580 

Pro He Leu Phe 
595 



Pro Pro Glu Gly 
535 

He Thr Tyr Val 
550 

Val Leu Arg Ala 
565 

Phe Thr Ser Tyr 



Asn Gin Glu He 
540 

Ala Asn Val Asn 
555 

Val Ala Lys Arg 
570 

Val Gin Glu Lys 
585 



Ser Arg Asp Asn 



Pro Gly Gly Trp 
560 

Glu Tyr Pro Lys 
575 

Thr Ala Gly Lys 
590 



<210> 23 
<211> 2038 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPBP-D169A 

<220> 
<221> CDS 

<222> (10) . . (1920) 
<400> 23 

gaattcacc atg gcc cca eta gec gac tac aag gac gac gat gac aag atg 51 
Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met 
15 10 

teg gat aat cag age tgg aac teg teg ggc teg gag gag gat cca gag 99 
Ser Asp Asn Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu 

15 * 20 25 30 

acg gag tct ggg ccg cct gtg gag cgc tgc ggg gtc etc agt aag tgg 147 
Thr Glu Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp 
35 40 45 



aca aac tac att cat ggg tgg cag gat cgt tgg gta gtt ttg aaa aat 
Thr Asn Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn 
50 55 60 



aga gga tec ate tgt ctt age aag get gtc ate aca cct cac gat ttt 
Arg Gly Ser He Cys Leu Ser Lys Ala Val He Thr Pro His Asp Phe 
80 85 90 

gat gaa tgt cga ttt gat att agt gta aat gat agt gtt tgg tat ctt 
Asp Glu Cys Arg Phe Asp He Ser Val Asn Asp Ser Val Trp Tyr Leu 
95 100 105 HO 

cgt get cag gat cca gat cat aga cag caa tgg ata gat gcc att gaa 
Arg Ala Gin Asp Pro Asp His Arg Gin Gin Trp He Asp Ala He Glu 
115 120 125 



195 



aat get ctg agt tac tac aaa tct gaa gat gaa aca gag tat ggc tgc 243 
Asn Ala Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys 
65 70 75 



291 



339 



387 
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cag cac aag act gaa tct gga tat gga tct gaa tec age ttg cgt cga 435 
Gin His Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg 
130 135 140 

cat ggc tea atg gtg tec ctg gtg tct gga gca agt ggc tac tct gca 483 
His Gly Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala 
145 150 155 

aca tec acc tct tea ttc aag aaa ggc cac agt tta cgt gag aag ttg 531 
Thr Ser Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu 
160 165 170 

get gaa atg gaa aca ttt aga gee ate tta tgt aga caa gtt gac acg 579 
Ala Glu Met Glu Thr Phe Arg Ala lie Leu Cys Arg Gin Val Asp Thr 
175 180 185 190 

eta cag aag tac ttt gat gec tgt get gat get gtc tct aag gat gaa 627 
Leu Gin Lys Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu 
195 200 205 

ctt caa agg gat aaa gtg gta gaa gat gat gaa gat gac ttt cct aca 675 
Leu Gin Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr 
210 215 * 220 

acg cgt tct gat ggt gac ttc ttg cat agt acc aac ggc aat aaa gaa 72 3 
Thr Arg Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu 
225 230 235 

aag tta ttt cca cat gtg aca cca aaa gga att aat ggt ata gac ttt 771 
Lys Leu Phe Pro His Val Thr Pro Lys Gly He Asn Gly He Asp Phe 
240 245 250 

aaa ggg gaa gcg ata act ttt aaa gca act act get gga ate ctt gca 819 
Lys Gly Glu Ala He Thr Phe Lys Ala Thr Thr Ala Gly He Leu Ala 
255 260 265 270 

aca ctt tct cat tgt att gaa eta atg gtt aaa cgt gag gac age tgg 867 
Thr Leu Ser His Cys He Glu Leu Met Val Lys Arg Glu Asp Ser Trp 
275 280 285 

cag aag aga ctg gat aag gaa act gag aag aaa aga aga aca gag gaa 915 
Gin Lys Arg Leu Asp Lys Glu Thr Glu Lys Lys Arg Arg Thr Glu Glu 
290 295 300 

gca tat aaa aat gca atg aca gaa ctt aag aaa aaa tec cac ttt gga 963 
Ala Tyr Lys Asn Ala Met Thr Glu Leu Lys Lys Lys Ser His Phe Gly 
305 310 315 

gga cca gat tat gaa gaa ggc cct aac agt ctg att aat gaa gaa gag 1011 
Gly Pro Asp Tyr Glu Glu Gly Pro Asn Ser Leu He Asn Glu Glu Glu 
320 325 330 

ttc ttt gat get gtt gaa get get ctt gac aga caa gat aaa ata gaa 1059 
Phe Phe Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys He Glu 
335 340 345 350 

gaa cag tea cag agt gaa aag gtg aga tta cat tgg cct aca tec ttg 1107 
Glu Gin Ser Gin Ser Glu Lys Val Arg Leu His Trp Pro Thr Ser Leu 
355 360 365 

ccc tct gga gat gee ttt tct tct gtg ggg aca cat aga ttt gtc caa 1155 
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Pro Ser Gly Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val Gin 
370 375 380 



aag ccc cat agt cgc tct tec tec atg tct tec att gat eta gtc agt 
Lvs Pro Tyr Ser Arg Ser Ser Ser Met Ser Ser lie Asp Leu Val Ser 
385 390 395 



1203 



gec tec gat gat gtt cac aga ttc age tec cag gtt gaa gag atg gtg 1251 
Aia Ser Asp Asp Val His Arg Phe Ser Ser Gin Val Glu Glu Met Val 
400 405 410 

cag aac cac atg act tac tea tta cag gat gta ggc gga gat gec aat 1299 
Gin Asn His Met Thr Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn 
4 15 420 425 430 

tag cag ttg gtt gta gaa gaa gga gaa atg aag gta tac aga aga gaa 
Trp Gin Leu Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg Glu 
435 440 445 

gta gaa gaa aat ggg att gtt ctg gat cct tta aaa get acc cat gca 
Val Glu Glu Asn Gly lie Val Leu Asp Pro Leu Lys Ala Thr His Ala 
450 4S5 460 

gtt aaa age gtc aca gga cat gaa gtc tgc aat tat ttc tgg aat gtt 
Val Lvs Giv Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val 
465 470 475 

gac gtt cac aat gac tgg gaa aca act ata gaa aac ttt cat gtg gtg 
Asp Val Arg Asn Asp Trp Glu Thr Thr He Glu Asn Phe His Val Val 
4 80 4 85 490 

gaa aca tta get gat aat gca ate ate att tat caa aca cac aag agg 
Glu Tr.r Leu Ala Asp Asn Ala He He He Tyr Gin Thr His Lys Arg 
495 500 505 510 

gtg tgg cct get tct cag cga gac gta tta tat ctt tct gtc att cga 
Val Trp Pro Ala Ser Gin hrg Asp Val Leu Tyr Leu Ser Val He Arg 
515 520 525 

aag ata cca gec ttg act gaa aat gac cct gaa act tgg ata gtt tgt 
Lys He Pro Ala Leu Thr Glu Asn Asp Pro Glu Thr Trp He Val Cys 
530 535 540 

aat ttt tct gtg gat cat gac agt get cct eta aac aac cga tgt gtc 
Asn Phe Ser Val Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cys Val 
545 550 555 

cgt gec aaa ata aat gtt get atg att tgt caa acc ttg gta age cca 1731 
Arg Ala Lys He Asn Val Ala Met He Cys Gin Thr Leu Val Ser Pro 
560 565 570 



1347 



1395 



1443 



1491 



1539 



1587 



1635 



1683 



cca gaa gga aac cag gaa att age agg gac aac att eta tgc aag att 
Pro Glu Gly Asn Gin Glu He Ser Arg Asp Asn He Leu Cys Lys He 
575 ' 580 585 590 

aca tat gta get aat gtg aac cct gga gga tgg gca cca gee tea gtg 
Thr Tvr val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser Val 
595 600 605 

tta agg gca atg gca aag cga gag tat cct aaa ttt eta aaa cgt ttt 
Leu Arg A. a Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lys Arg Phe 
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610 615 620 

act tct tac gtc caa gaa aaa act gca gga aag cct att ttg ttc 1920 
Thr Ser Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro lie Leu Phe 
625 630 635 

tagtattaac aggtactaga agatatgttt tatctttttt taactttatt tgactaatat 1980 

gactgtcaat actaaaattt agttgttgaa agtatttact atgttttttc cggaattc 2038 

<210> 24 
<211> 637 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; GPBP-D169A 
<400> 24 

Met Ala Pro Leu Ala Asp Tyr Lys Asp Asp Asp Asp Lys Met Ser Asp 
1 5 10 15 

Asr. Gin Ser Trp Asn Ser Ser Gly Ser Glu Glu Asp Pro Glu Thr Glu 
20 25 *" 30 

Ser Gly Pro Pro Val Glu Arg Cys Gly Val Leu Ser Lys Trp Thr Asn 
35 40 45 

Tyr He His Gly Trp Gin Asp Arg Trp Val Val Leu Lys Asn Asn Ala 
SO 55 60 

Leu Ser Tyr Tyr Lys Ser Glu Asp Glu Thr Glu Tyr Gly Cys Arg Gly 
6S 70 75 80 

Ser He Cys Leu Ser Lys Ala Val lie Thr Pro His Asp Phe Asp Glu 
85 90 95 

Cys Arg Phe Asp lie Ser Val Asn Asp Ser Val Trp Tyr Leu Arg Ala 
100 105 110 

Gin Asp Pro Asp His Arg Gin Gin Trp lie Asp Ala lie Glu Gin His 
115 120 125 

Lys Thr Glu Ser Gly Tyr Gly Ser Glu Ser Ser Leu Arg Arg His Gly 
130 135 140 

Ser Met Val Ser Leu Val Ser Gly Ala Ser Gly Tyr Ser Ala Thr Ser 
1^5 150 155 160 

Thr Ser Ser Phe Lys Lys Gly His Ser Leu Arg Glu Lys Leu Ala Glu 
165 170 ~ 175 

Met Glu Thr Phe Arg Ala lie Leu Cys Arg Gin Val Asp Thr Leu Gin 
180 185 190 

Lys Tyr Phe Asp Ala Cys Ala Asp Ala Val Ser Lys Asp Glu Leu Gin 
195 200 205 

Arg Asp Lys Val Val Glu Asp Asp Glu Asp Asp Phe Pro Thr Thr Arg 
210 215 220 
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Ser Asp Gly Asp Phe Leu His Ser Thr Asn Gly Asn Lys Glu Lys Leu 
225 230 235 240 

Phe Pro His Val Thr Pro Lys Gly lie Asn Gly lie Asp Phe Lys Gly 
245 250 255 

Glu Ala lie Thr Phe Lys Ala Thr Thr Ala Gly lie Leu Ala Thr Leu 
260 265 270 

Ser His Cys lie Glu Leu Met Val Lys Arg Glu Asp Ser Trp Gin Lys 
275 280 285 

Arg Leu Asp Lys Glu Thr Glu Lys Lys Arg Arg Thr Glu Glu Ala Tyr 
290 295 300 

Lys Asn Ala Met Thr Glu Leu Lys Lys Lys Ser His Phe Gly Gly Pro 
305 310 315 320 

Asp Tyr Glu Glu Gly Pro Asn Ser Leu lie Asn Glu Glu Glu Phe Phe 
325 330 335 

Asp Ala Val Glu Ala Ala Leu Asp Arg Gin Asp Lys lie Glu Glu Gin 
340 345 , 350 

Ser Gin Ser Glu Lys Val Arg Leu His Trp Pro Thr Ser Leu Pro Ser 
355 360 365 

Gly Asp Ala Phe Ser Ser Val Gly Thr His Arg Phe Val Gin Lys Pro 
370 375 380 

Tyr Ser Arg Ser Ser Ser Met Ser Ser lie Asp Leu Val Ser Ala Ser 
385 390 395 400 

Asp Asp Val His Arg Phe Ser Ser Gin Val Glu Glu Met Val Gin Asn 
405 410 415 

His Met Thr Tyr Ser Leu Gin Asp Val Gly Gly Asp Ala Asn Trp Gin 
420 425 430 

Leu Val Val Glu Glu Gly Glu Met Lys Val Tyr Arg Arg Glu Val Glu 
435 440 445 

Glu Asn Gly He Val Leu Asp Pro Leu Lys Ala Thr His Ala Val Lys 
450 455 460 

Gly Val Thr Gly His Glu Val Cys Asn Tyr Phe Trp Asn Val Asp Val 
465 470 475 480 

Arg Asn Asp Trp Glu Thr Thr He Glu Asn Phe His Val Val Glu Thr 
485 490 495 

Leu Ala Asp Asn Ala He He lie Tyr Gin Thr His Lys Arg Val Trp 
500 505 510 

Pro Ala Ser Gin Arg Asp Val Leu Tyr Leu Ser Val He Arg Lys He 
515 520 525 

Pro Ala Leu Thr Glu Asn Asp Pro Glu Thr Trp He Val Cys Asn Phe 
530 535 540 
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Ser Val Asp His Asp Ser Ala Pro Leu Asn Asn Arg Cvs Val Arg Ala 
545 550 555 " * 560 

Lys He Asn Val Ala Met He Cys Gin Thr Leu Val Ser Pro Pro Glu 
565 570 575 

Gly Asn Gin Glu He Ser Arg Asp Asn He Leu Cys Lys He Thr Tyr 
580 585 590 

Val Ala Asn Val Asn Pro Gly Gly Trp Ala Pro Ala Ser Val Leu Arg 
595 600 605 

Ala Val Ala Lys Arg Glu Tyr Pro Lys Phe Leu Lvs Arg Phe Thr Ser 
610 615 620 

Tyr Val Gin Glu Lys Thr Ala Gly Lys Pro lie Leu Phe 
625 630 * 635 

<210> 25 

<211> 12482 

<212> DNA 

<213> Homo sapiens 

<400> 25 

tcgaccactt ccctcttcat attcagtgta tattgcacag atctctcaac aacacagcca 60 
ttaaatagat attctccaag tgacacttac atcacacatg cttgagttta cgttacttgc 120 
aaacataggg aaagaaagat acatgggata aactggtgca tgagaaatga gatcttagca 180 
gttggttgaa ataaatgaga acaactgagg caaactaaag aggaagaagg gcaagtggca 24 0 
gcttaacagg agtaagatga tgagatgaag ggcagaatac cttcatggag aggaggcaaa 3 00 
gagatataca tgatatgttc ttaggaacat aactgaagca aacaatgata ttatttctaa 360 
ttatatataa acctgtgagt cagccttcca ggggcggcct gccaaggtag aatcattgga 420 
atgatttggc cagggtttgg ataggagaga attggcagca gcgttaagat tgacccatga 480 
caaataatgc tatgcaggta gcagggagtc tgactaggag caaaatcaac gaacttatcc 540 
cttgcctaac atagcatctg tggagtcaga aagaagaggt taaattggga tatctgaggc 600 
aagtatcagg atttgccatg tctgcggagt agtttcataa ttctaatggt tataagcact 660 
aaggcgttca ctaagtgaac gttggtagtt ccaggttata ttatccattc ttgagttaca 720 
aaatacactt taaaaccttc ccatcttaat attatatgtt tttttagtca cagagtgaaa 780 
aggtgagatt acattggcct acatccttgc cctctggaga tgccttttct tctgtgggga 840 
cacatagatt tgtccaaaag gtaagctaat gtcagagttt actaaaagta caccttgtat 900 
tgttcttcat tgttggtgga aatatctttt atttgagacg gagtctcact ctgtcaccag 960 
agtggagtgc agtggcgcga tctcggctca ctacagtctc cacctcccgg gttcaagaga 1020 
ttctcgtgcc tcagcctccc tggtagctgg gattacaggc atgtaccacc acacccagct 1080 
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aatttttgta tttttaatgg agacagccrc accatggcca ggatggtctt gatctcctga 1140 
ccttgtgatc cacccacctc agcctcccag agtgctggga ttacaggcgt gagccaccat 1200 
gcccagccgg aaatatcttg tagtatataa gttttctccc cttttcatta atttaagtaa 1260 
tgagactgtt tttggtttta tatattgtat tccatataca tcctccaaaa cagttagaaa 1320 
ttttgttctg aaaataaagt tctttcattt ttatttaagg ggaaagttgg gggtgggcaa 13 80 
ataaggagtg gctagtccaa aatagttaac cagaagtata tccagttata ctaaatctct 1440 
ctcttctttg gggttaaatg gtattacttt gtattattgg aagcactaca ttcttttttg 1500 
gaatgatttt ggaacataat acataatagg tgcatgaagt cagcagttgc tgctgtgctt 1560 
gtttcatata gtgctttgtt ttctcttccc tttatcttgt gtttggaagt tggtactgaa 1620 
tgccctgttg tgcctttgtt ctgattactt ggttttttct ttgtctgtct ctggtagccc 1680 
cacagtcgct cttcctccat gtcttccatt gatctagtca gtgcctctga tgatgttcac 1740 
agactcagct cccaggtact gtatgaatgt atagagtgga cttgagtctt tctgtgctat 1800 
attccagcct gctttcccag ttcctagaaa tcttttggtt aggccactga ttttagtttt 1860 
gaattttaaa tagtaacatt aagcattaaa aaggtcttcc ttgtctacta aatagttcct 1920 
cngtcaggct tgcacgtgtc ctttactatt cacagcttgg aattttgtca tataggaggt 1980 
actccaqaaa gattttcaaa ctgaattgaa acaaatagaa gatactgggt tttgtatatc 2040 
atgtaacatc tgtttcttca gtcaggattt agcagttttg atggacgtgg tccatatgat 2100 
atgtcatagc agaaaagcag atttttacaa gtctcacttt aaagcctaaa gtacccccaa 2160 
ttaatattca acaaggaaat cactttttaa taatatgttt catttccatt ataatactaa 2220 
ac:c:atcga gcagattgtg ttttccttat gcaaattacc tttggatatt ataaatgaat 2280 
attrctgtcc atatgctaaa tctatggaaa tttgttttaa tttttagcat tggtaagggc 2340 
ttaggaaccc aagacaggaa gctggatgct tgcggtctct aaagtctgta ccctcaaaat 2400 
aaaaccagat taccattgga agaagttttt tttagtgtca gcgttagttc tttttttaat 2460 
tctcttaatc ttcacatctt tgccattcaa ctttttatct ttctggtgat tgcattttat 2520 
cggactagat tatattatgt taatcttata ttaaagacct gagcactctg gtcagaatga 2580 
ctcagtttaa accctggtta ggtgtatgat cccagtaagt tttctaactt ttttgtgctt 2640 
catttttatg atttagctag aacctgacac ataataagtg ctcaataaat gttaccttgt 2700 
attgctatta taacataatt tctttgagct aataaaagtt atctacatca ttattttttc 2760 
ctccgtgaga gtattgctat aaaagttttt aaaagtcata gtttaaagag atttctatta 2820 
tttttatgtt tataaataaa gtttacatta gtttttaacc tgcaatagag aagaatatta 2880 
agactttaat ttttctgact tgtacagcgt ttttctcctt gaatactctt aagaaaaaga 2940 
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tttagcaact ctggatcaga aatcatccat 
ttgcttgtcc atttatgcat ttttttttaa 
tgctctgttg cccaggctgg agtgcagtgg 
cccaggttca agcaattctc ccgcctcagc 
ccactatgcc cagctaattt ttgtattctt 
ggctggtcta gcactcctga cctcgtgatc 
ttacaggtgt gagccaccat gcccggccct 
ataatgaatc tgtgacaaaa ctacataata 
ccacattgca ttaaaatcat atcacagact 
aatgaaagaa aatataatga cagcaaagaa 
agaattaggg gttcagaacc caaaccaggg 
ttgcttttaa gtccaggtaa ctatatcact 
tttgctgaca gcttacttct tcgtaacatt 
cttgagagaa atcttgctgg atttttttct 
cctgttttag cctttgatta tgtactgttt 
ataacattaa ggtcttcaag tatttgttgt 
acgtggactc tagcaattca ctgttgtaat 
tttataatag atgactaatt cccaattgat 
gccttatcag ccattatcaa aggatctagc 
tatagcattg gatgtagatc tggagttaat 
aactcttcca gttcaacaca gtgtaaaatt 
ttacagttct cacaccctat agagactttt 
ttcattaaca ttagaaacac ttatgttata 
tttttttttt ttttttgaga cggagtttcg 
acgatcttgg ctcactgcac cctctgcccc 
acctgagtag ctgggattac aggcacctgc 
tagtagagat ggggtttcac catgttggcc 
acctgcccgc ctcggcctcc caaagtgctg 
ctacatcgtt cttaatacac aaatatacat 
taaccaaatt ctttgtttta taatatcttc 



aaccaaatat accacagtat attttacctt 3000 
ttttacttat ttattttcga gacagggtct 3060 
cacgatctgg gctcactgca acctccatct 3120 
ctcccaagta gctgggatta caggcacgca 3180 
agtaaagacg ggttttcacc atgttggcca 3240 
tgcccacctc ggcctcccaa agtgctggga 3300 
gcgtatgttt ttaaaaagag actcatattc 3360 
ctgggagact ttggtttatt gtgctaagct 3420 
aatcaaaaat gcaggaatac ataggctata 3480 
agaatgtaag ccagtaataa agaatgccta 3540 
ccctcactgt agtgctgtag aacagctgaa 3600 
gagaagcagg tgcctatatt tttacaaaat 3660 
aatacccttt tgtaaaactc atgtatgtaa 3720 
ctaatatatg gtgctcatga ttgatcagat 3780 
tatatgccag aagaggtaaa aatgaagaaa 3 84 0 
ccttgctaaa gcattagttg tcattagcag 3 900 
taaattgtgt gccttatgtt cagcagttcc 3960 
aagatttttt gtttcagagg atgttacact 4020 
aagttgattc tgtatagtca cacttgagaa 4080 
attagttgag aaacattgcg ttatctggaa 4140 
atagtagtga ctacacagta gtgttacatt 4200 
gtattaacaa aataagaggc tcaaaggtta 4260 
ttacattgca tcggtctttt ctgttttttg 4320 
cttttgttgc ccaggctgga gtgcaatggt 4380 
ctggattcaa gcgattctct tgcctcagcc 4440 
caccacaccc agctaatttt ttttcatttt 4500 
aggctggtct cgaactcctg acctcaggtg 4560 
ggattacagg catgagccgc cacacctggc 4620 
cagttactcc acagcgcttg atatgggagg 4680 
ataattaatt aaaaaactaa gtcgacattt 4740 
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ttaatcacct ttaataattt gccaaaatat tatataagca taatataatc aattcttact 4800 
tactccaaca aattttaaaa gtccagatac agataccata tctagtttct tgatcattta 4860 
tatcagctcc catacagaag ccttctaaat ctctggtaat ttcactttgc tgtttatata 4920 
agtgttggct catgactacc ttgttcttct tgaaatgatg ttttatagcc ttgaattggc 4980 
tgaaataatc aagtgtacaa ttgagagatg ccctgaaaac agcttaaaat aaaatatgca 5040 
catctactag gaaattagta ccaacacatg aatctgtctg atgggcagat attaggaatg 5100 
aagtcactcc agatctgaga aattaaagtt gtaaaggact gcaagttctg tgtttttgtt 5160 
gttgttgttg ttgttgttgt tgtttgtttt ttcatttttg ttttttgggt ttttttgaga 5220 
cagagtctca ttctgtcacc caggctgtag tgcagtggca cgacctcaac tcactgcaac 5280 
ctccgtctcc, caggttcaag cgattctcct gtctcagctg ggaccacagg cacacgctat 5340 
cacacccagc naatttttgt atttttagta gagacagggt ttcaccatgt tagccaggct 5400 
ggtctcgaac tcctgacctc aagtgatctg cccgtctcgg cctcccaaag tgctgggatt 5460 
acaggcctga gacaccatgc ccagcatttt tttttttttt tttttttttt gtaaagagac 5520 
aaggtttcac ttgtccaggc caagtgcagt ggcatgatca tagctctgta acctgacctc 5580 
tgacctctga cttcctggac acaagtgatc ctcctgtctc tcagcctccc aagtagctgg 5640 
gaccacaggc attccaccac acccaactaa ttgtttttat tttttgtaga gacagggcct 5700 
tgctatgttg cccaggctgg caagttcttg aaataatggc tgtggccaca aactagaaaa 5760 
taattttcag gtgtacagag aatagaaaga atttagattc ataaattgat cattttgttc 5820 
acagttattt gcataacaca gttcacattt aaaggtgtca ccctagaaat caaaggggaa 5880 
gaacatcatc ctctattgaa aaagaaagaa atcaaaggat gtacagtgaa tttgcagctt 5940 
aatctatggg gagcatcatt gcaaaaaacg gttctgtgtg aggctcrttc ccaccctttg 6000 
tccataggag cacattattg ttgtagtaat tatttcaccc ctctcccttt ttcagtgtac 6060 
aagtgataca tgctaatttt. aacagaactt gaaagtagaa taaaattaaa ataatagttt 612 0 
actaatattc catttatctt ctctcatata tatgagataa atattaaggt: gtatgtactt 6180 
atccatatgt gcctgatttt tcaaaatcct tgtatatgca tctttgcacc cttatctaat 6240 
tatttcctta gaatatattc ctagaagcat aattgtggga acaaaggcca tgaacatttt 6300 
caagtgttta ttttattatt ttattttatt tttattaatt ttgatacagg gttttgcttt 6360 
gttccccaga ctggagtgca gtggtgagat caccacccac tgcaccttga cctcctggac 6420 
tcaagcgatc cacctgcctc agtctcctca gtagcggggg ctaaggacta caggcacatg 6480 
ccatcatgcc cagctaattt ttttatttgc agcagagacg aggtctcact gtgttgccca 6540 
ggctgctatt ttatttattt tttaagagat agggtctcat tctgtcttcc aggctagaat 6600 
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gcagtggcac aatcatagct cactgcaacc tcaagcgatc tttgcctcag cctgagtagc 6660 
tgggactaca ggcatgggcc accactctca gctaattttt ttttcaattt tttatttttt 6720 
gtagatatgg gggtctcact gtgttgccta ggctggtctt gaacccctag cctaaagtga 6780 
tcttcccacc tcagcctccc aaagtgctag gattacaggc cacaggcctc agccaagttt 6840 
taaaaatttt tactgccaaa ctcttcatta gaaaagttga accagcttac attcccaggc 6900 
cagttttcta ttgatatagt agcactgaat attataattc agttaacttt tgtcaatacg 6960 
gtaggctaaa agtgctatgt tcttagccat ctctcttttg ggttaacagt gcactatttt 7020 
gttattaata attattctat ctaacaagcc ccctctatgg ttttgtggct ttgtagtaag 7080 
catagttgta tttccttttt tgaggtggag tcttgctatg ttgcccaggc tggagtgcag 7140 
tggcgcgatc tcggctcact gcaccctccg cctcccgggt tcaagtgatt ctcctgcctc 7200 
agactcctga gcatctggga ctacaggcat gcaccaccac gcccagctaa ttttttatat 7260 
ttttagtaga gaggggagtt caccgtgtta gccgggatgg tctctatctc ttgacctcgt 7320 
ggtccgcgtg cctcagcctc ccaaaatgct gtgattacag gcatgagcca ccctgcctgg 7380 
ccaacatttc ttttacatgc ataaaagaga tctgagctgt ttttgagccc ttctagactt 7440 
tctttttttt tttttttctt tttttttttt tttttttttt ttctttttaa gtagacgagg 7500 
tcttgctatg ttgccgagac ttaacctcaa actcctaggc ccaagcaatc ctcccaagct 7560 
gctgggacta caggcatgaa ccaccatgcc caacttagac ttttattgta ctatcaaaag 7620 
gcaattttct tttcaaattt ctgggtaata gtgttagaaa aatcctactt ggtaacatcc 7680 
agaaatggca tcatactgag tgattcaaat gtgagatgga agaaaaggtt agaattggag 774 0 
tgaacgtccc ctcttatctc aaatgtattt tatctccatt ttgtttcata gtttattagt 7800 
ttgaagatgc tttgaatgtc acctaatcat tttcaactct aggtccagaa aaatcaaggg 7860 
catgatttct gaaattacac ttagcctaat taaaacttag aaacactgtt caccttcttc 7920 
aatgtttttg actgagtctt tttcatttat aagtgacagg aggtgttact ataacattat 7980 
ttcctagaat gtcaaatttt gagcctaata gcatggtaaa tttggctata tttgttgttt 8040 
tttgtttttg tttttttttt aatgaaactt agtatttcct tgtttcccac ttcttttttt 8100 
tttttttttt tttttttttt tgagacggag tctctctctg tcatccaggc tggagtgcaa 8160 
tggcgtgatc ttggctcact gccacctccg cctcgcaggt tcacgctatt ctcctttcac 8220 
agcctcctga gtagctggga ctacaggcac ccaccaccac gcccggccaa tttttttgta 8280 
tttttagtag agacggggtt ttaccatgtt aggcaggatg gtctcgaact cctgaccttg 8340 
tgatctgccc gcctcagcct cccaaagtgc tgggattaca ggcgtgagcc accgcacctg 8400 
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gcctcccact tctttttaat atgtcgtgtc ataactgaac agcaaagtga gcagattatc 8460 
aggttaaatc tgaagtgtca gtctggtcac cagtgcccaa gttactgccc ctatggtaat 8520 
attggttact ttgtattttc ctacagcaaa cataaaattt gttatagtga gatttttacc 8580 
tgtatacctc tcttaacttt aatgttatta cctcaaggaa gatattatca tgaatgaaga 8640 
ttccatgatg aaagttttgc agagtttatt gcagtaattt agtacttcat tagaatcttt 8700 
agttttttag gagcacagta ctgaatgttt gtttctttgt tggacctttt gaaaaccggt 8760 
tttccattga tgcagtgtag ctgttacagg aatatcattt ttaaaacgtt tttatacagc 8820 
atggctgaaa attgaacctg ggcctccctc gtggcctacc attgaaggaa cagcattttt 8880 
tgcctatcta gaaagacaat gttaaatgtg ctatctatat atttcttaac ttgtgctacc 8940 
tactacgcgt ttatatttgt ggaatctgtt ttcttttgga caaaaccaca aatcaaaaac 9000 
acctcatttc ttaggcattt gaaatcccta attcagaata atctcccaaa cagaaacaca 9060 
actacctgca ttctttttga caaaagagct. aagtagcatt agaaaattat tttaaaccca 9120 
attctgtttt ttaacagaat aaaattcttc tgttcttcac attcttcttt cataggtaac 9180 
ctattgaaag tagggtttat ttgggggaag catttctttc tgtctcttat ctcataataa 9240 
atacaggtgt gcttaactac tagtttccta cctcaaagat atactcaaat ctaaagatgt 9300 
ttaagatttt gggatctgaa gagtaaacat ttctcctaat cacaatgtga cagagacaaa 9360 
tgaatcaagc caatgctact tttatttatg catactaact ggaacttttc tttttggaaa 9420 
tcagatacat tttgtatgta ttagtaattt ggaatcctgc attggttatc ctcgccctcc 9480 
caaagcagat tctgaaatta taaaggtgca caggttctcc atgcaacacc aaaagttata 9540 
ttttccaagg ctttgtaaaa ttgtagaatg tcctgttaaa tttctgtcaa atcagtaact 9600 
cacactgttt tgagaattat gaataaagga ataaaatatt gttagtgttt atttagtaca 9660 
aaagtagatt atagaatctc agcatttttg tcaaaaaatt tctttttgat gattgacaga 9720 
tcaggagaca cttaaggcca tacctgcttt cagtaatcaa aaatgcattt aagatccaga 9780 
aacttgaggt agcagaacat cactatcaca tataacatat cctttggtat agaaaattat 9840 
attcccagag tgagtttctt ttttaaaacc attaatgagg ccaaggtggg aagatcactt 9900 
gggaccagga gttcaagacc aagcctgggc cagatggcga gaccctgtct ctacaaaaaa 9960 
ttaactggat gtggtggtgc actcctgtag tcccacctac tcagaggctg aggcaggagg 10020 
atcccttgag cccaggaaat tgtagtggca gtgagctatg atcatactac tgtactgcag 10080 
tctgggccac gaagtgagac cgtgtctctt aaaaaaaaaa aaatgttagg catggtggca 10140 
caggcatata gttttagcta cttaggaggc tgaggcagga ggatcacttg agcccagaag 10200 
ttcaagatta cagtgagtta tgattgtgcc gctgcactcc aacctgggtg acaaaataac 10260 

60 



S DOC ID <WO 0050607A2J_> 



WO 00/50607 



PCT/IBOO/00324 



cctgtctctg gcgggtaggg gggaagttga ttatttactt tgaaatatgt tcaaaactga 10320 
ttcctgttct atattcctaa tgaacagaat agactttata taaaacaaat agttaaactt 10380 
aaggataaaa ttttaatgga agtataatat atatatcttc cagctcttct gtcttctaat 10440 
gtatttatta cagaaaatga aattactttg tttccgcaat ctttgtatca cttcagttct 10500 
ccaataaatc tgagaattct ggtagtgtga aatattcagc tttctttgct tatttacata 10560 
aaatgtataa ggacaatttg tgataattaa gagttacatt taaatatcag gaaaaagtta 10620 
taaatttaaa ttaaaaaatt ttaaaaggaa attattagaa attttaaaag aatgaactaa 10680 
aaggtgatta tatgtaaatg cttgcatata tgaatattag cattgtcccc aaaataattt 10740 
agaacaaaga aattggaatc aaataaataa aggtttgatt atttttaaat tggcttatat 10800 
tccatgataa aagagaggtt tatcagtggc ataagaaagg tttttcacct tttttgtatt 10860 
gaaatctttg acatatacat atatatcttt gctcatcttt gtgtatcttt gctcgtatga 10920 
gagcaaagat ataggcaaag atatgctctc tctctctatg tctttgttca taccaagacc 10980 
ttcctgatat ctccacataa tcttaaatat aggaacatta gactggatga tctctgtgcc 11040 
ccctttatct ctactcttcc attattttat actttaacac atcatctctg ttttatgata 11100 
taagaatgga atatttcttt tttcctgaaa atgcttattt tggtcacttg atacacatta 11160 
ggccaatatg tgttacttga gtgacccatc ttccttcttt tcatttctgt ctcctgtcat 11220 
taacctggat atctggaatg tggactaaac tcttcaaaca ctatgtaaaa cctactaacc 112 80 
tttgtgcatt tggttgctca gctactaaga gcaccatttc tgaactgaag ttaactgaag 11340 
accattctgt tttagagatt atgacatacc ttttggattc tcatgccttt ttcctccctt 11400 
ctcaaggttg aagagatggt gcagaaccac atgacttact cattacagga tgtaggcgga 11460 
gatgccaatt ggcagttggt tgtagaagaa ggagaaacga aggtaattcc ccctgaaatg 1152 0 
ttatagattg ccaaaggcgt ctctgtttca gtcatattat cattactatt gatatgaata 11580 
aggatagcac tttcaactta cctttaaaac aaattattac atgtgatcaa agcagtacca 11640 
tatattgagc aataaaatgt ctttttgctt ttctggcttt gcctttacta aaggttttta 11700 
tgattataat ataaatatat gattaaacct ttctgttttg actaggccat gaagaaaata 11760 
aaatttagag aattagatat gaccaggtca caattagctg atggtcctgt atttggatat 11820 
ttccttttgt tttgtttttt taacatactg aatgttgtgc ctagatgaca ctttgtttct 11880 
ctcccttttt ggtctatacc ctccttcttt tcccttctct tactgcacct ttaattgata 11940 
tttggacatt ggtcagttaa tcctggttac acccctaaac acatggacag aaaataagag 12000 
cagggactga gagatacaga gatggattga aaagcaaaag caacattgaa ttttggattt 12 060 
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tctcattcct aaggaactat gctaaataaa gatacaaaga taataagaca ctctccaagc 12120 
taaagcttta gttaaggaaa aagaatattg acatttaaaa gatactattg gccaggcaca 12180 
gtggctatgc ctgtaatccc agcactttta ggaggacatg gcaggcggat tacttgagct 1224 0 
caggagttca agtcaaacct gggcaacacg gtgaaacccc gtctctacca aaaatacaaa 12300 
aattagctgg gtgcagtacc acacacttgt agtcccagcc acccaggagg ctgggcaaaa 12360 
gattccttga gccagggagg tcaaggctgc aatgagccgc gtttgtgcca ctgcactcta 12420 
gcctgggtca caaagtgaga ccctgtgtga gatatatata tatatatata tatatatata 12480 
ta 12482 



<210> 26 
<211> 21 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPpepl 
<400> 26 

Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr Thr Arg 
1 " 5 10 15 

Gly Phe Val Phe Thr 
20 



<210> 27 
<211> 21 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPpeplAla9 
<400> 27 

Lys Gly Lys Arg Gly Asp Ala Gly Ser Pro Ala Thr Trp Thr Thr Arg 
! ' ~ 5 10 15 

Gly Phe Val Phe Thr 
20 



<210> 28 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP-54m 
<400> 28 

tcgaattcac catggcccca ctagccgact acaaggacga cgatgacaag 50 
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<210> 29 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP-55c 
<400> 29 

ccgagcccga cgagttccag ctctgattat ccgacatctt gtcatcgtcg 50 

<210> 30 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-HNC-B-N- 14m 
<400> 30 

cgggatccgc tagctaagcc aggcaaggat gg 32 

<210> 31 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-HNC-B-N-16C 
<400> 31 

cgggatccat gcataaatag cagttctgct gt 32 



<210> 32 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FLAG peptide 
<400> 32 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 



<210> 33 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Hypothetical 
peptide 

<400> 33 

Pro Arg Ser Ala Arg Cys Gin Ala Arg Arg Arg Arg Gly Gly Arg Thr 
1 5 10 15 
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Ser Ser 



<210> 34 
<211> 25 
^212 > DNA 

<213> Artificial Sequence 
O20> 

<223> Description of Artificial Sequence : ON-GPBP-llm 
<400> 34 

gcgggactca gcggccggat tttct 25 



<210> 35 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: ON-GPBP-15m 
<400> 35 

acagctggca gaagagac 



<210> 36 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP- 20c 
<400> 36 

catgggtagc ttttaaag 18 



<210> 37 
<211> 27 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP-22m 
<400> 3 -1 

tagaagaaca gtcacagagt gaaaagg 



<210> 38 
<211> 23 
<212> DNA 

c213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP-53c 
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<400> 38 

gaattcgaac aaaataggct ttc 23 



<210> 39 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP-56m 
<400> 39 

ccctatagtc gctcttc 17 



<210> 40 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

*.223 > Description of Artificial Sequence: ON-GPBP-57c 
v400> AO 

ctgggagctg aatctgt 17 



v 2 1 C > 4 1 
v211 > 24 

-212* d:;a 

v213> Artificial Sequence 
- 220 - 

-223 ^ Description of Artificial Sequence: ON-GPBP-62c 
<400- A I 

gtggttctgc accatctctt caac 24 



2NA 

Art if icial Sequence 
<220> 

<223> Description of Artificial Sequence: ON-GPBP-26 
<400> 42 

cacatagatt tgtccaaaag gttgaagaga tggtgcagaa c 41 



-2 10 > 

■ 2 1 : > 

<212> 

v213> 



<210> 43 
<211> 19 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPIII derived 
peptide 
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<400> 43 

Gin Arg Ala His Gly Gin Asp Leu Asp Ala Leu Phe Val Lys Val Leu 
15 10 15 

Arg Ser Pro 



<210> 44 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPIII-IV-V 
derived peptide 

<400> 44 

Gin Arg Ala His Gly Gin Asp Leu Glu Ser Leu Phe His Gin 
1 ~ 5 10 



<210> 45 
<211> 685 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPDV 

<220> 

<221> CDS 

<222> (1) . . (633) 

<400> 45 

ggt ttg aaa gga aaa cgt gga gac agt gga tea cct gca acc tgg aca 4 8 
Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
15 10 15 



acg aga ggc ttt gtc ttc acc cga cac agt caa acc aca gca att cct 
Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala He Pro 
20 25 30 

tea tgt cca gag ggg aca gtg cca etc tac agt ggg ttt tct ttt ctt 
Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
35 40 45 

ttt gta caa gga aat caa cga gec cac gga caa gac ctt gga act ctt 
Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Gly Thr Leu 
50 55 60 

ggc age tgc ctg cag cga ttt acc aca atg cca ttc tta ttc tgc aat 
Gly Ser Cys Leu Gin Arg Phe Thr Thr Met Pro Phe Leu Phe Cys Asn 
65 70 75 80 

gtc aat gat gta tgt aat ttt gca tct cga aat gat tat tea tac tgg 
Val Asn Asp Val Cys Asn Phe Ala Ser Arg Asn Asp Tyr Ser Tyr Trp 
85 90 95 

ctg tea aca cca get ctg atg cca atg aac atg get ccc att act ggc 
Leu Ser Thr Pro Ala Leu Met Pro Met Asn Met Ala Pro He Thr Gly 



96 



144 



192 



240 



288 



336 
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100 105 110 

aga gcc ctt gag cct tat ata age aga tgc act gtt tgt gaa ggt cct 384 
Arg Ala Leu Glu Pro Tyr lie Ser Arg Cys Thr Val Cys Glu Gly Pro 
115 120 125 

gcg ate gcc ata gcc gtt cac age caa ace act gac att cct cca tgt 432 
Ala lie Ala lie Ala Val His Ser Gin Thr Thr Asp lie Pro Pro Cys 
130 135 140 

cct cac ggc tgg att tct etc tgg aaa gga ttt tea ttc ate atg aaa 480 
Pro His Gly Trp lie Ser Leu Trp Lys Gly Phe Ser Phe lie Met Lys 
145 150 155 160 

gcc tat tec ate aac tgt gaa age tgg gga att aga aaa aat aat aag 52 8 
Ala Tyr Ser lie Asn Cys Glu Ser Trp Gly lie Arg Lys Asn Asn Lys 
165 170 175 

teg ctg tea ggt gtg cat gaa gaa aag aca ctg aag eta aaa aag aca 576 
Ser Leu Ser Gly Val His Glu Glu Lys Thr Leu Lys Leu Lys Lys Thr 
180 185 190 

gca gaa ctg eta ttt ttc ate eta aag aac aaa gta atg aca gaa cat 624 
Ala Glu Leu Leu Phe Phe lie Leu Lys Asn Lys Val Met Thr Glu His 
195 200 205 

get gtt att taggtatttt tctttaacca aacaatattg ctccatgatg 673 
Ala Val lie 
210 

acttagtaca aa 685 



<210> 46 
<211> 211 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPDV 
<400> 46 

Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
15 10 15 

Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala lie Pro 
20 25 30 

Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
35 40 45 

Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Gly Thr Leu 
50 55 60 

Gly Ser Cys Leu Gin Arg Phe Thr Thr Met Pro Phe Leu Phe Cys Asn 
65 70 75 80 

Val Asn Asp Val Cys Asn Phe Ala Ser Arg Asn Asp Tyr Ser Tyr Trp 
85 90 95 

Leu Ser Thr Pro Ala Leu Met Pro Met Asn Met Ala Pro He Thr Gly 
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100 105 110 

Arg Ala Leu Glu Pro Tyr lie Ser Arg Cys Thr Val Cys Glu Gly Pro 
115 120 125 

Ala lie Ala lie Ala Val His Ser Gin Thr Thr Asp lie Pro Pro Cys 
130 135 140 

Pro His Gly Trp lie Ser Leu Trp Lys Gly Phe Ser Phe lie Met Lys 
14*5 150 155 160 

Ala Tyr Ser lie Asn Cys Glu Ser Trp Gly lie Arg Lys Asn Asn Lys 
165 170 175 

Ser Leu Ser Gly Val His Glu Glu Lys Thr Leu Lys Leu Lys Lys Thr 
180 185 190 

Ala Glu Leu Leu Phe Phe lie Leu Lys Asn Lys Val Met Thr Glu His 
195 200 205 

Ala Val He 
210 



<210> 47 
<211> 680 
<212> DNA 

<213> Artificial Sequence 
v220> 

^223-* Description of Artificial Sequence: GPDIII 

v220> 

< 2 2 1 > CDS 

v222> (1) • - (216) 

<400> 47 

ggt trg aaa gga aaa cgt gga gac agt gga tea cct gca acc tgg aca 

Gly Leu Lvs Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
: 5 10 15 

acg aaa ggc ttt gtc ttc acc cga cac agt caa acc aca gca att cct 
Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala He Pro 
20 25 30 

tea tgt cca gag ggg aca gtg cca etc tac agt ggg ttt tct ttt ctt 
Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
35 40 45 

ttt gca caa gga aat caa cga gee cac gga caa gac ctt gat gca ctg 
Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Asp Ala Leu 
50 55 60 



48 



96 



144 



192 



ttt gtg aag gtc ctg cga teg cca tagcegttea cagccaaacc actgacattc 246 
Phe Val Lys Val Leu Arg Ser Pro 
65 * 70 

ctccatgtcc teaeggctgg atttctctct ggaaaggatt ttcattcatc atgttcacaa 306 

gtgcaggttc tgagggcacc gggcaagcac tggcctcccc tggctcctgc ctggaagaat 366 
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tccgagccag cccatttcta gaatgtcatg gaagaggaac gtgcaactac tattcaaatt 426 

cctacagttt ctggctggct tcattaaacc cagaaagaat gttcagaaag cctattccat 486 

caactgtgaa agctggggaa ttagaaaaaa taataagtcg ctgtcaggtg tgcatgaaga 546 

aaagacactg aagctaaaaa agacagcaga actgctattt ttcatcctaa agaacaaagt 606 

aatgacagaa catgctgtta tttaggtatt tttctttaac caaacaatat tgctccatga 666 

tgacttagta caaa 680 

<210> 48 
<211> 72 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPDIII 
<400> 48 

Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
15 10 15 

Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala lie Pro 
20 25 30 

Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
35 40 45 

Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Asp Ala Leu 
50 55 60 

Phe Val Lys Val Leu Arg Ser ' Pro 
65 70 



<210> 49 
<211> 392 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPDIII-IV-V 

<220> 

<221> CDS 

<222> (1) . . (207) 

<400> 49 

ggt ttg aaa gga aaa cgt gga gac agt gga tea cct gca acc tgg aca 4 8 

Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 

15 10 15 

acg aga ggc ttt gtc ttc acc cga cac agt caa acc aca gca att cct 96 
Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala lie Pro 
20 25 30 

tea tgt cca gag ggg aca gtg cca etc tac agt ggg ttt tct ttt ctt 144 
Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 



69 



;DOCID: <WO 0050607A2_L> 



WO 00/50607 



PCT/IB00/00324 



35 40 45 

ttt gta caa gga aat caa cga gcc cac gga caa gac ctt gaa age eta 192 
Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Glu Ser Leu 
50 55 60 

ttc cat caa ctg tga aagctgggga attagaaaaa ataataagtc gctgtcaggt 247 
Phe His Gin Leu 
65 

gtgcatgaag aaaagacact gaagctaaaa aagacagcag aactgetatt tttcatccta 307 
aagaacaaag taatgacaga acatgctgtt atttaggtat ttttctttaa ccaaacaata 367 
ttgctccatg atgacttagt acaaa 392 



<210> 50 
<211> 68 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPDIII-IV-V 
<400> 50 

Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
1 5 10 15 

Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala He Pro 
20 25 30 

Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
35 40 45 

Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Glu Ser Leu 
50 55 60 

Phe His Gin Leu 
65 



<210> 51 
<211> 507 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPDIII-V 

<220> 

<221> CDS 

<222> (1) . . (216) 

<400> 51 

ggt ttg aaa gga aaa cgt gga gac agt gga tea cct gca acc tgg aca 4 8 

Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
1 5 10 15 

acg aga ggc ttt gtc ttc acc cga cac agt caa acc aca gca att cct 96 
Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala He Pro 
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20 25 30 

tea tgt cca gag ggg aca gtg cca etc tac agt ggg ttt tct ttt ctt 144 
Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
35 40 45 

ttt gta caa gga aat caa cga gee cac gga caa gac ctt gat gca ctg 192 
Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Asp Ala Leu 
50 55 60 

ttt gtg aag gtc ctg cga teg cca tagcegttea cagccaaacc actgacattc 246 
Phe Val Lys Val Leu Arg Ser Pro 
65 70 

ctccatgtcc teaeggctgg atttctctct ggaaaggatt ttcattcatc atgaaagect 306 

attccatcaa ctgtgaaagc tggggaatta gaaaaaataa taagtcgctg tcaggtgtgc 366 

atgaagaaaa gacactgaag ctaaaaaaga cagcagaact gctatttttc atcctaaaga 426 

acaaagtaat gacagaacat gctgttattt aggtattttt ctttaaccaa acaatattgc 486 

tccatgatga cttagtacaa a 507 

<210> 52 
<211> 72 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPDIII-V 
<400> 52 

Gly Leu Lys Gly Lys Arg Gly Asp Ser Gly Ser Pro Ala Thr Trp Thr 
IS 10 15 

Thr Arg Gly Phe Val Phe Thr Arg His Ser Gin Thr Thr Ala lie Pro 
20 25 30 

Ser Cys Pro Glu Gly Thr Val Pro Leu Tyr Ser Gly Phe Ser Phe Leu 
35 40 45 

Phe Val Gin Gly Asn Gin Arg Ala His Gly Gin Asp Leu Asp Ala Leu 
50 55 60 

Phe Val Lys Val Leu Arg Ser Pro 
65 70 

<210> 53 
<211> 659 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HMBP-21 

<220> 

<221> CDS 

<222> (37) . . (627) 
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<400> 53 

gaaaacagtg cagccacctc cgagagcctg gatgtg atg gcg tea cag aag aga 54 

Met Ala Ser Gin Lys Arg 

1 5 

ccc tec cag agg cac gga tec aag tac ctg gee aca gca agt ace atg 102 

Pro Ser Gin Arg His Gly Ser Lys Tyr Leu Ala Thr Ala Ser Thr Met 
10 15 20 



gac cat gee agg cat ggc ttc etc cca agg cac aga gac acg ggc ate 
Asp His Ala Arg His Gly Phe Leu Pro Arg His Arg Asp Thr Gly lie 
25 30 35 



150 



ctt gac tec ate ggg cgc ttc ttt ggc ggt gac agg ggt gcg cca aag 198 
Leu Asp Ser lie Gly Arg Phe Phe Gly Gly Asp Arg Gly Ala Pro Lys 
40 45 50 

egg ggc tct ggc aag gta ccc tgg eta aag ccg ggc egg age cct ctg 246 
Arg Gly Ser Gly Lys Val Pro Trp Leu Lys Pro Gly Arg Ser Pro Leu 
55 60 65 70 

ccc tct cat gee cgc age cag cct ggg ctg tgc aac atg tac aag gac 
Pro Ser His Ala Arg Ser Gin Pro Gly Leu Cys Asn Met Tyr Lys Asp 
75 80 85 

tea cac cac ccg gca aga act get cac tat ggc tec ctg ccc cag aag 
Ser His His Pro Ala Arg Thr Ala His Tyr Gly Ser Leu Pro Gin Lys 
90 95 100 

tea cac ggc egg acc caa gat gaa aac ccc gta gtc cac ttc ttc aag 
Ser His Gly Arg Thr Gin Asp Glu Asn Pro Val Val His Phe Phe Lys 
105 110 115 

aac att gtg acg cct cgc aca cca ccc ccg teg cag gga aag ggg aga 
Asn lie Val Thr Pro Arg Thr Pro Pro Pro Ser Gin Gly Lys Gly Arg 
120 125 130 

gga ctg tec ctg age aga ttt age tgg ggg gee gaa ggc cag aga cca 
Gly Leu Ser Leu Ser Arg Phe Ser Trp Gly Ala Glu Gly Gin Arg Pro 
135 140 145 150 

gga ttt ggc tac gga ggc aga gcg tec gac tat aaa teg get cac aag 534 
Gly Phe Gly Tyr Gly Gly Arg Ala Ser Asp Tyr Lys Ser Ala His Lys 
155 160 165 

gga ttc aag gga gtc gat gee cag ggc acg ctt tec aaa att ttt aag 5 82 
Gly Phe Lys Gly Val Asp Ala Gin Gly Thr Leu Ser Lys lie Phe Lys 
170 175 180 

ctg gga gga aga gat agt cgc tct gga tea ccc atg get aga cgc 
Leu Gly Gly Arg Asp Ser Arg Ser Gly Ser Pro Met Ala Arg Arg 
185 190 195 

tgaaaaccca cctggttccg gaatcctgtc ct 659 



294 



342 



390 



438 



486 



627 



<210> 54 
<211> 197 
<212> PRT 
<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: HMBP-21 



<400> 54 
Met Ala Ser Gin 
1 

Ala Thr Ala Ser 
20 

His Arg Asp Thr 
35 

Asp Arg Gly Ala 
50 

Pro Gly Arg Ser 
65 

Cys Asn Met Tyr 



Gly Ser Leu Pro 
100 

Val Val His Phe 
115 

Ser Gin Gly Lys 
130 



Lys Arg Pro Ser 
5 

Thr Met Asp His 



Gly lie Leu Asp 
40 

Pro Lys Arg Gly 
55 

Pro Leu Pro Ser 
70 

Lys Asp Ser His 
85 

Gin Lys Ser His 



Phe Lys Asn lie 
120 

Gly Arg Gly Leu 
135 



Gin Arg His Gly 
10 

Ala Arg His Gly 
25 

Ser lie Gly Arg 



Ser Gly Lys Val 
60 

His Ala Arg Ser 
75 

His Pro Ala Arg 
90 

Gly Arg Thr Gin 
105 

Val Thr Pro Arg 



Ser Leu Ser Arg 
140 



Ser Lys Tyr Leu 
15 

Phe Leu Pro Arg 
30 

Phe Phe Gly Gly 
45 

Pro Trp Leu Lys 



Gin Pro Gly Leu 
80 

Thr Ala His Tyr 
95 

Asp Glu Asn Pro 
110 

Thr Pro Pro Pro 
125 

Phe Ser Trp Gly 



Ala Glu Gly Gin Arg Pro Gly Phe 

145 150 

Tyr Lys Ser Ala His Lys Gly Phe 
165 

Leu Ser Lys lie Phe Lys Leu Gly 
180 



Gly Tyr Gly Gly Arg Ala Ser Asp 
155 160 

Lys Gly Val Asp Ala Gin Gly Thr 
170 175 

Gly Arg Asp Ser Arg Ser Gly Ser 
185 190 



Pro Met Ala Arg Arg 
195 
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